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About  the  Textbook 


A  Concise  Introduction  to  Logic  is  an  introduction  to  formal  logic  suitable  for  undergraduates 
taking  a  general  education  course  in  logic  or  critical  thinking,  and  is  accessible  and  useful 
to  any  interested  in  gaining  a  basic  understanding  of  logic.  This  text  takes  the  unique 
approach  of  teaching  logic  through  intellectual  history;  the  author  uses  examples  from 
important  and  celebrated  arguments  in  philosophy  to  illustrate  logical  principles.  The  text 
also  includes  a  basic  introduction  to  findings  of  advanced  logic.  As  indicators  of  where  the 
student  could  go  next  with  logic,  the  book  closes  with  an  overview  of  advanced  topics,  such 
as  the  axiomatic  method,  set  theory,  Peano  arithmetic,  and  modal  logic.  Throughout,  the 
text  uses  brief,  concise  chapters  that  readers  will  find  easy  to  read  and  to  review. 


Reviewer's  Notes 

Adam  Kovach 


True  to  its  name,  A  Concise  Introduction  to  Logic,  by  Craig  DeLancey,  surveys  propositional 
logic  and  predicate  logic  and  goes  on  to  introduce  selected  advanced  topics,  in  little  over 
200  pages.  The  book  provides  an  integrated  presentation  of  basic  syntactic  and  semantic 
concepts  and  methods  of  logic.  Part  I  starts  with  the  concept  of  a  formal  language.  The 
concept  of  valid  inference,  truth  tables  and  proofs  are  introduced  immediately  after  the 
first  two  propositional  connectives.  Connectives  and  inference  rules  are  introduced  in 
alternation,  to  develop  a  complete  simple  natural  deduction  system  for  propositional  logic. 
Part  II,  adds  the  apparatus  of  quantification  and  proof  rules  for  a  complete  predicate  logic. 
The  text  covers  the  logic  of  relations,  sentences  with  multiple  quantifiers  and  Russell’s 
theory  of  definite  descriptions.  The  presentation  of  concepts  and  principles  is  orderly, 
clear  and  thought  provoking.  Many  topics  are  introduced  with  examples  of  philosophical 
arguments  drawn  from  classic  sources,  adding  depth  of  knowledge  to  an  introductory 
course.  The  first  two  parts  end  with  systematic  overviews.  The  focus  is  on  formal  deductive 
logic  throughout.  Informal  fallacies  and  traditional  syllogistic  logic  are  not  covered. 
Advanced  topics  covered  in  the  final  part  of  the  text  include  an  axiomatic  approach  to 
logic,  mathematical  induction,  a  deduction  theorem  for  propositional  logic,  and  brief 
introductions  to  set  theory,  modal  logic  and  number  theory. 

The  reviewer,  Adam  Kovach,  is  Associate  Professor  of  Philosophy  at  Marymount  University  in 
Arlington,  V A,  where  he  teaches  courses  in  many  subjects  including  logic. 


0.  Introduction 


0.1  Why  study  logic? 

Logic  is  one  of  the  most  important  topics  you  will  ever  study. 

“How  could  you  say  such  a  thing?”  you  might  well  protest.  And  yet,  consider:  logic 
teaches  us  many  things,  and  one  of  these  is  how  to  recognize  good  and  bad  arguments. 
Not  just  arguments  about  logic — any  argument. 

Nearly  every  undertaking  in  life  will  ultimately  require  that  you  evaluate  an  argument, 
perhaps  several.  You  are  confronted  with  a  question:  Should  I  buy  this  car  or  that  car? 
Should  I  go  to  this  college  or  that  college?  Did  that  scientific  experiment  show  what  the 
scientist  claims  it  did?  Should  I  vote  for  the  candidate  who  promises  to  lower  taxes,  or  for 
the  one  who  says  she  might  raise  them?  And  so  on.  Our  lives  are  a  long  parade  of  choices. 
When  we  try  to  answer  such  questions,  in  order  to  make  the  best  choices,  we  often  have 
only  one  tool:  an  argument.  We  listen  to  the  reasons  for  and  against  various  options,  and 
must  choose  between  them.  And  so,  the  ability  to  evaluate  arguments  is  an  ability  that  is 
very  useful  in  everything  that  you  will  do — in  your  work,  your  personal  life,  your  deepest 
reflections. 

If  you  are  a  student,  note  that  nearly  every  discipline,  be  it  a  science,  one  of  the  humanities, 
or  a  study  like  business,  relies  upon  arguments.  Evaluating  arguments  is  the  most 
fundamental  skill  common  to  math,  physics,  psychology,  literary  studies,  and  any  other 
intellectual  endeavor.  Logic  alone  tells  you  how  to  evaluate  the  arguments  of  any 
discipline. 

The  alternative  to  developing  these  logical  skills  is  to  be  always  at  the  mercy  of  bad 
reasoning  and,  as  a  result,  you  will  make  bad  choices.  Worse,  you  will  always  be 
manipulated  by  deceivers.  Speaking  in  Canandaigua,  New  York,  on  August  3,  1857,  the 
escaped  slave  and  abolitionist  leader  Frederick  Douglass  observed  that: 

Power  concedes  nothing  without  a  demand.  It  never  did  and  it  never  will.  Find  out 
just  what  any  people  will  quietly  submit  to  and  you  have  found  out  the  exact 
measure  of  injustice  and  wrong  which  will  be  imposed  upon  them,  and  these  will 
continue  till  they  are  resisted  with  either  words  or  blows,  or  with  both.  The  limits 
of  tyrants  are  prescribed  by  the  endurance  of  those  whom  they  oppress. 

We  can  add  to  Frederick  Douglass’s  words  that:  find  out  just  how  much  a  person  can 
be  deceived,  and  that  is  just  how  far  she  will  be  deceived.  The  limits  of  tyrants  are  also 
prescribed  by  the  reasoning  abilities  of  those  they  aim  to  oppress.  And  what  logic  teaches 
you  is  how  to  demand  and  recognize  good  reasoning,  and  so  how  to  avoid  deceit.  You  are 
only  as  free  as  your  powers  of  reasoning  enable. 

0.2  What  is  logic? 

Some  philosophers  have  argued  that  one  cannot  define  “logic”.  Instead,  one  can  only  show 
logic,  by  doing  it  and  teaching  others  how  to  do  it.  I  am  inclined  to  agree.  But  it  is  easy 
to  describe  the  benefits  of  logic.  For  example,  in  this  book,  you  will  learn  how  to: 
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•  Identify  when  an  argument  is  good,  and  when  it  is  bad; 

•  Construct  good  arguments; 

•  Evaluate  reasons,  and  know  when  they  should,  and  should  not,  be 
convincing; 

•  Describe  things  with  a  precision  that  avoids  misunderstanding; 

•  Get  a  sense  of  how  one  can  construct  the  foundations  of  arithmetic; 

•  Begin  to  describe  the  meaning  of  “possibility”  and  “necessity”. 

That  is  by  no  means  a  complete  list  of  the  many  useful  things  that  logic  can  provide.  Some 
of  us  believe  that  logic  and  mathematics  are  ultimately  the  same  thing,  two  endeavors 
with  the  same  underlying  structure  distinguished  only  by  different  starting  assumptions. 
On  such  a  view,  we  can  also  think  of  logic  as  the  study  of  the  ultimate  foundations  of 
mathematics.  (This  is  a  reasonable  characterization  of  logic,  but  those  afraid  of  mathematics 
need  not  fear:  logic  must  become  quite  advanced  before  its  relation  to  mathematics 
becomes  evident.) 

Ultimately,  the  only  way  to  reveal  the  beauty  and  utility  of  logic  is  to  get  busy  and  do  some 
logic.  In  this  book,  we  will  approach  the  study  of  logic  by  building  several  precise  logical 
languages  and  seeing  how  we  can  best  reason  with  these.  The  first  of  these  languages  is 
called  “the  propositional  logic”. 


0.3  A  note  to  students 

Logic  is  a  skill.  The  only  way  to  get  good  at  understanding  logic  and  at  using  logic  is  to 
practice.  It  is  easy  to  watch  someone  explain  a  principle  of  logic,  and  easier  yet  to  watch 
someone  do  a  proof.  But  you  must  understand  a  principle  well  enough  to  be  able  to  apply 
it  to  new  cases,  and  you  must  be  able  to  do  new  proofs  on  your  own.  Practice  alone  enables 
this. 

The  good  news  is  that  logic  is  easy.  The  very  goal  of  logic  is  to  take  baby  steps,  small  and 
simple  and  obvious,  and  after  we  do  this  for  a  long  while  we  find  ourselves  in  a  surprising 
and  unexpected  new  place.  Each  step  on  the  way  will  be  easy  to  take.  Logic  is  a  long 
distance  walk,  not  a  sprint.  Study  each  small  step  we  take,  be  sure  you  know  how  to  apply 
the  related  skills,  practice  them,  and  then  move  on.  Anyone  who  follows  this  advice  can 
master  logic. 


0.4  A  note  to  instructors 

This  book  incorporates  a  number  of  features  that  come  from  many  years  of  experience 
teaching  both  introductory  and  advanced  logic. 

First,  the  book  moves  directly  to  symbolic  logic.  I  don’t  believe  that  informal  logic  is  worth 
the  effort  that  it  requires.  Informal  logic  largely  consists  of  memorization  (memorizing 
seemingly  disconnected  rules,  memorizing  fallacies,  and  so  on).  Not  only  is  this  sure  to  be 
the  kind  of  thing  that  students  will  promptly  forget,  but  it  completely  obscures  the  simple 
beauty  of  why  the  various  rules  work,  and  why  the  fallacies  are  examples  of  bad  reasoning. 
A  student  who  learns  symbolic  logic,  however,  is  learning  a  skill.  Skills  are  retained  longer; 
they  encourage  higher  forms  of  reasoning;  and  they  have  far  more  power  than  a  memorized 
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list  of  facts.  Once  one  can  recognize  what  makes  an  argument  good,  one  can  recognize  the 
fallacies,  regardless  of  whether  one  has  memorized  their  names. 

Second,  this  book  focuses  on  some  of  the  deeper  features  of  logic,  right  at  the  beginning. 
The  notions  of  semantics  and  syntax  are  introduced  in  the  first  chapter.  Ideas  like  theorem, 
and  a  model,  are  discussed  early  on.  My  experience  has  shown  that  students  can  grasp  these 
concepts,  and  they  ultimately  pay  oft  well  by  greatly  expanding  their  own  understanding. 

Third,  this  book  uses  examples,  and  constructs  problems,  from  our  intellectual  history  in 
order  to  illustrate  key  principles  of  logic.  The  author  is  a  philosopher,  and  understands  logic 
to  be  both  the  method  of  philosophy  and  also  one  of  the  four  fundamental  sub-disciplines  of 
philosophy.  But  more  importantly,  these  examples  can  do  two  things.  They  make  it  clear 
that  arguments  matter.  Weighty  concerns  are  discussed  in  these  arguments,  and  whether 
we  accept  their  conclusions  will  have  significant  effects  on  our  society.  Seeing  this  helps 
one  to  see  the  importance  of  logic.  These  examples  can  also  make  this  book  suitable  for  a 
logic  course  that  aims  to  fulfill  a  requirement  for  an  introduction  to  the  history  of  thought, 
an  overview  of  Western  civilization,  or  the  knowledge  foundations  of  a  related  discipline. 

Fourth,  I  follow  a  no-shortcuts  principle.  Most  logic  textbooks  introduce  a  host  of 
shortcuts.  They  drop  outer  parentheses,  they  teach  methods  for  shrinking  truth  tables, 
and  so  on.  These  moves  often  confuse  students,  and  for  no  good  reason:  they  have  no 
conceptual  value.  I  suspect  they  only  exist  to  spare  the  impatience  of  instructors,  who 
would  like  to  write  expressions  and  truth  tables  more  quickly.  In  this  book,  except  in  the 
last  chapter  that  looks  to  advanced  logic,  we  will  not  introduce  exceptions  to  our  syntax, 
nor  will  we  spend  time  on  abridged  methods.  The  only  exception  is  writing  “T”  for 
true  and  “F”  for  false  in  truth  tables. 

Fifth,  this  book  includes  a  final  chapter  introducing  some  advanced  topics  in  logic.  The 
purpose  of  this  chapter  is  to  provide  students  with  some  understanding  of  the  exciting 
things  that  they  can  study  if  they  continue  with  logic.  In  my  experience,  students  imagine 
that  advanced  logic  will  be  just  more  proofs  in  first  order  logic.  Giving  them  a  taste  of  what 
can  come  next  is  valuable.  My  hope  is  that  this  chapter  will  motivate  students  to  want  to 
study  more  logic,  and  also  that  it  can  serve  as  a  bridge  between  their  studies  in  basic  logic 
and  the  study  of  advanced  logic. 

Finally,  about  typesetting:  quotation  is  an  important  logical  principle,  and  so  I  adopted  the 
precise  but  comparatively  rare  practice  of  putting  punctuation  outside  of  quotes.  This  way, 
what  appears  in  the  quotations  is  alone  what  is  being  defined  or  otherwise  mentioned.  I 
use  italics  only  to  indicate  the  meaning  of  a  concept,  or  to  distinguish  symbolic  terms  of  the 
object  language  from  functions  of  the  object  language.  Bold  is  used  to  set  aside  elements 
of  our  metalanguage  or  object  language. 

0.5  Contact 

The  author  would  appreciate  any  comments,  advice,  or  discoveries  of  errata.  He  can  be 
contacted  at:  craig.delancey@oswego.edu 
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Part  I:  Propositional  Logic 


1 .  Developing  a  Precise  Language 

1.1  Starting  with  sentences 

We  begin  the  study  of  logic  by  building  a  precise  logical  language.  This  will  allow  us  to  do 
at  least  two  things:  first,  to  say  some  things  more  precisely  than  we  otherwise  would  be  able 
to  do;  second,  to  study  reasoning.  We  will  use  a  natural  language — English — as  our  guide, 
but  our  logical  language  will  be  far  simpler,  far  weaker,  but  more  rigorous  than  English. 

We  must  decide  where  to  start.  We  could  pick  just  about  any  part  of  English  to  try  to 
emulate:  names,  adjectives,  prepositions,  general  nouns,  and  so  on.  But  it  is  traditional, 
and  as  we  will  see,  quite  handy,  to  begin  with  whole  sentences.  For  this  reason,  the  first 
language  we  will  develop  is  called  “the  propositional  logic”.  It  is  also  sometimes  called  “the 
sentential  logic”  or  even  “the  sentential  calculus”.  These  all  mean  the  same  thing:  the  logic 
of  sentences.  In  this  propositional  logic,  the  smallest  independent  parts  of  the  language  are 
sentences  (throughout  this  book,  I  will  assume  that  sentences  and  propositions  are  the  same 
thing  in  our  logic,  and  I  will  use  the  terms  “sentence”  and  “proposition”  interchangeably). 

There  are  of  course  many  kinds  of  sentences.  To  take  examples  from  our  natural  language, 
these  include: 

What  time  is  it? 

Open  the  window. 

Damn  you! 

I  promise  to  pay  you  back. 

It  rained  in  Central  Park  on  June  26,  2015. 

We  could  multiply  such  examples.  Sentences  in  English  can  be  used  to  ask  questions,  give 
commands,  curse  or  insult,  form  contracts,  and  express  emotions.  But,  the  last  example 
above  is  of  special  interest  because  it  aims  to  describe  the  world.  Such  sentences,  which 
are  sometimes  called  “declarative  sentences”,  will  be  our  model  sentences  for  our  logical 
language.  We  know  a  declarative  sentence  when  we  encounter  it  because  it  can  be  either 
true  or  false. 


1.2  Precision  in  sentences 

We  want  our  logic  of  declarative  sentences  to  be  precise.  But  what  does  this  mean?  We 
can  help  clarify  how  we  might  pursue  this  by  looking  at  sentences  in  a  natural  language 
that  are  perplexing,  apparently  because  they  are  not  precise.  Here  are  three. 

Tom  is  kind  of  tall. 

When  Karen  had  a  baby,  her  mother  gave  her  a  pen. 
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This  sentence  is  false. 

We  have  already  observed  that  an  important  feature  of  our  declarative  sentences  is  that  they 
can  be  true  or  false.  We  call  this  the  “truth  value”  of  the  sentence.  These  three  sentences 
are  perplexing  because  their  truth  values  are  unclear.  The  first  sentence  is  vague,  it  is  not 
clear  under  what  conditions  it  would  be  true,  and  under  what  conditions  it  would  be  false. 
If  Tom  is  six  feet  tall,  is  he  kind  of  tall?  There  is  no  clear  answer.  The  second  sentence  is 
ambiguous.  If  “pen”  means  writing  implement,  and  Karen’s  mother  bought  a  playpen  for 
the  baby,  then  the  sentence  is  false.  But  until  we  know  what  “pen”  means  in  this  sentence, 
we  cannot  tell  if  the  sentence  is  true. 

The  third  sentence  is  strange.  Many  logicians  have  spent  many  years  studying  this 
sentence,  which  is  traditionally  called  “the  Liar”.  It  is  related  to  an  old  paradox  about  a 
Cretan  who  said,  “All  Cretans  are  liars”.  The  strange  thing  about  the  Liar  is  that  its  truth 
value  seems  to  explode.  If  it  is  true,  then  it  is  false.  If  it  is  false,  then  it  is  true.  Some 
philosophers  think  this  sentence  is,  therefore,  neither  true  nor  false;  some  philosophers  think 
it  is  both  true  and  false.  In  either  case,  it  is  confusing.  How  could  a  sentence  that  looks  like 
a  declarative  sentence  have  both  or  no  truth  value? 

Since  ancient  times,  philosophers  have  believed  that  we  will  deceive  ourselves,  and  come  to 
believe  untruths,  if  we  do  not  accept  a  principle  sometimes  called  “bivalence”,  or  a  related 
principle  called  “the  principle  of  non-contradiction”.  Bivalence  is  the  view  that  there  are 
only  two  truth  values  (true  and  false)  and  that  they  exclude  each  other.  The  principle  of 
non-contradiction  states  that  you  have  made  a  mistake  if  you  both  assert  and  deny  a  claim. 
One  or  the  other  of  these  principles  seems  to  be  violated  by  the  Liar. 

We  can  take  these  observations  for  our  guide:  we  want  our  language  to  have  no  vagueness 
and  no  ambiguity.  In  our  propositional  logic,  this  means  we  want  it  to  be  the  case  that  each 
sentence  is  either  true  or  false.  It  will  not  be  kind  of  true,  or  partially  true,  or  true  from  one 
perspective  and  not  true  from  another.  We  also  want  to  avoid  things  like  the  Liar.  We  do 
not  need  to  agree  on  whether  the  Liar  is  both  true  and  false,  or  neither  true  nor  false.  Either 
would  be  unfortunate.  So,  we  will  specify  that  our  sentences  have  neither  vice. 

We  can  formulate  our  own  revised  version  of  the  principle  of  bivalence,  which  states  that: 

Principle  of  Bivalence:  Each  sentence  of  our  language  must  be  either  true  or  false, 
not  both,  not  neither. 

This  requirement  may  sound  trivial,  but  in  fact  it  constrains  what  we  do  from  now  on  in 
interesting  and  even  surprising  ways.  Even  as  we  build  more  complex  logical  languages 
later,  this  principle  will  be  fundamental. 

Some  readers  may  be  thinking:  what  if  I  reject  bivalence,  or  the  principle  of  non¬ 
contradiction?  There  is  a  long  line  of  philosophers  who  would  like  to  argue  with  you, 
and  propose  that  either  move  would  be  a  mistake,  and  perhaps  even  incoherent.  Set  those 
arguments  aside.  If  you  have  doubts  about  bivalence,  or  the  principle  of  non-contradiction, 
stick  with  logic.  That  is  because  we  could  develop  a  logic  in  which  there  were  more  than 
two  truth  values.  Logics  have  been  created  and  studied  in  which  we  allow  for  three  truth 
values,  or  continuous  truth  values,  or  stranger  possibilities.  The  issue  for  us  is  that  we  must 
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start  somewhere,  and  the  principle  of  bivalence  is  an  intuitive  way  and — it  would  seem — the 
simplest  way  to  start  with  respect  to  truth  values.  Learn  basic  logic  first,  and  then  you  can 
explore  these  alternatives. 

This  points  us  to  an  important  feature,  and  perhaps  a  mystery,  of  logic.  In  part,  what 
a  logical  language  shows  us  is  the  consequences  of  our  assumptions.  That  might  sound 
trivial,  but,  in  fact,  it  is  anything  but.  From  very  simple  assumptions,  we  will  discover 
new,  and  ultimately  shocking,  facts.  So,  if  someone  wants  to  study  a  logical  language 
where  we  reject  the  principle  of  bivalence,  they  can  do  so.  The  difference  between  what 
they  are  doing,  and  what  we  will  do  in  the  following  chapters,  is  that  they  will  discover 
the  consequences  of  rejecting  the  principle  of  bivalence,  whereas  we  will  discover  the 
consequences  of  adhering  to  it.  In  either  case,  it  would  be  wise  to  learn  traditional  logic 
first,  before  attempting  to  study  or  develop  an  alternative  logic. 

We  should  note  at  this  point  that  we  are  not  going  to  try  to  explain  what  “true”  and 
“false”  mean,  other  than  saying  that  “false”  means  not  true.  When  we  add  something  to 
our  language  without  explaining  its  meaning,  we  call  it  a  “primitive”.  Philosophers  have 
done  much  to  try  to  understand  what  truth  is,  but  it  remains  quite  difficult  to  define  truth 
in  any  way  that  is  not  controversial.  Fortunately,  taking  true  as  a  primitive  will  not  get  us 
into  trouble,  and  it  appears  unlikely  to  make  logic  mysterious.  We  all  have  some  grasp  of 
what  “true”  means,  and  this  grasp  will  be  sufficient  for  our  development  of  the  propositional 
logic. 


1 .3  Atomic  sentences 

Our  language  will  be  concerned  with  declarative  sentences,  sentences  that  are  either  true  or 
false,  never  both,  and  never  neither.  Here  are  some  example  sentences. 


2+2=4. 


Malcolm  Little  is  tall. 

If  Lincoln  wins  the  election,  then  Lincoln  will  be  President. 

The  Earth  is  not  the  center  of  the  universe. 

These  are  all  declarative  sentences.  These  all  appear  to  satisfy  our  principle  of  bivalence. 
But  they  differ  in  important  ways.  The  first  two  sentences  do  not  have  sentences  as  parts. 
For  example,  try  to  break  up  the  first  sentence.  “2+2”  is  a  function.  “4”  is  a  name.  “=4”  is 
a  meaningless  fragment,  as  is  “2+”.  Only  the  whole  expression,  “2+2=4”,  is  a  sentence  with 
a  truth  value.  The  second  sentence  is  similar  in  this  regard.  “Malcolm  Little”  is  a  name, 
“is  tall”  is  an  adjective  phrase  (we  will  discover  later  that  logicians  call  this  a  “predicate”). 
“Malcolm  Little  is”  or  “is  tall”  are  fragments,  they  have  no  truth  value. Only  “Malcolm 
Little  is  tall”  is  a  complete  sentence. 

The  first  two  example  sentences  above  are  of  a  kind  we  call  “atomic  sentences”.  The  word 
“atom”  comes  from  the  ancient  Greek  word  “atomos”,  meaning  cannot  be  cut.  When  the 
ancient  Greeks  reasoned  about  matter,  for  example,  some  of  them  believed  that  if  you  took 
some  substance,  say  a  rock,  and  cut  it  into  pieces,  then  cut  the  pieces  into  pieces,  and  so  on, 
eventually  you  would  get  to  something  that  could  not  be  cut.  This  would  be  the  smallest 
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possible  thing.  (The  fact  that  we  now  talk  of  having  “split  the  atom”  just  goes  to  show 
that  we  changed  the  meaning  of  the  word  “atom”.  We  came  to  use  it  as  a  name  for  a 
particular  kind  of  thing,  which  then  turned  out  to  have  parts,  such  as  electrons,  protons, 
and  neutrons.)  In  logic,  the  idea  of  an  atomic  sentence  is  of  a  sentence  that  can  have  no 
parts  that  are  sentences. 

In  reasoning  about  these  atomic  sentences,  we  could  continue  to  use  English.  But  for 
reasons  that  become  clear  as  we  proceed,  there  are  many  advantages  to  coming  up  with  our 
own  way  of  writing  our  sentences.  It  is  traditional  in  logic  to  use  upper  case  letters  from 
P  on  (P,  Q,  R,  S...  .)  to  stand  for  atomic  sentences.  Thus,  instead  of  writing 

Malcolm  Little  is  tall. 

We  could  write 

P 

If  we  want  to  know  how  to  translate  P  to  English,  we  can  provide  a  translation  key. 
Similarly,  instead  of  writing 

Malcolm  Little  is  a  great  orator. 

We  could  write 

Q 

And  so  on.  Of  course,  written  in  this  way,  all  we  can  see  about  such  a  sentence  is  that  it 
is  a  sentence,  and  that  perhaps  P  and  Q  are  different  sentences.  But  for  now,  these  will  be 
sufficient. 

Note  that  not  all  sentences  are  atomic.  The  third  sentence  in  our  four  examples  above 
contains  parts  that  are  sentences.  It  contains  the  atomic  sentence,  “Lincoln  wins  the 
election”  and  also  the  atomic  sentence,  “Lincoln  will  be  President”.  We  could  represent  this 
whole  sentence  with  a  single  letter.  That  is,  we  could  let 

If  Lincoln  wins  the  election,  Lincoln  will  be  president. 

be  represented  in  our  logical  language  by 

S 

However,  this  would  have  the  disadvantage  that  it  would  hide  some  of  the  sentences  that 
are  inside  this  sentence,  and  also  it  would  hide  their  relationship.  Our  language  would  tell 
us  more  if  we  could  capture  the  relation  between  the  parts  of  this  sentence,  instead  of  hiding 
them.  We  will  do  this  in  chapter  2. 

1 .4  Syntax  and  semantics 

An  important  and  useful  principle  for  understanding  a  language  is  the  difference  between 
syntax  and  semantics.  “Syntax”  refers  to  the  “shape”  of  an  expression  in  our  language.  It 
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does  not  concern  itself  with  what  the  elements  of  the  language  mean,  but  just  specifies  how 
they  can  be  written  out. 

We  can  make  a  similar  distinction  (though  not  exactly  the  same)  in  a  natural  language. 
This  expression  in  English  has  an  uncertain  meaning,  but  it  has  the  right  “shape”  to  be  a 
sentence: 


Colorless  green  ideas  sleep  furiously. 

In  other  words,  in  English,  this  sentence  is  syntactically  correct,  although  it  may  express 
some  kind  of  meaning  error. 

An  expression  made  with  the  parts  of  our  language  must  have  correct  syntax  in  order  for 
it  to  be  a  sentence.  Sometimes,  we  also  call  an  expression  with  the  right  syntactic  form  a 
“well-formed  formula”. 

We  contrast  syntax  with  semantics.  “Semantics”  refers  to  the  meaning  of  an  expression 
of  our  language.  Semantics  depends  upon  the  relation  of  that  element  of  the  language  to 
something  else.  For  example,  the  truth  value  of  the  sentence,  “The  Earth  has  one  moon” 
depends  not  upon  the  English  language,  but  upon  something  exterior  to  the  language. 
Since  the  self-standing  elements  of  our  propositional  logic  are  sentences,  and  the  most 
important  property  of  these  is  their  truth  value,  the  only  semantic  feature  of  sentences  that 
will  concern  us  in  our  propositional  logic  is  their  truth  value. 

Whenever  we  introduce  a  new  element  into  the  propositional  logic,  we  will  specify  its 
syntax  and  its  semantics.  In  the  propositional  logic,  the  syntax  is  generally  trivial,  but  the 
semantics  is  less  so.  We  have  so  far  introduced  atomic  sentences.  The  syntax  for  an  atomic 
sentence  is  trivial.  If  P  is  an  atomic  sentence,  then  it  is  syntactically  correct  to  write  down 

P 

By  saying  that  this  is  syntactically  correct,  we  are  not  saying  that  P  is  true.  Rather,  we  are 
saying  that  P  is  a  sentence. 

If  semantics  in  the  propositional  logic  concerns  only  truth  value,  then  we  know  that  there 
are  only  two  possible  semantic  values  for  P;  it  can  be  either  true  or  false.  We  have  a  way  of 
writing  this  that  will  later  prove  helpful.  It  is  called  a  “truth  table”.  For  an  atomic  sentence, 
the  truth  table  is  trivial,  but  when  we  look  at  other  kinds  of  sentences  their  truth  tables  will 
be  more  complex. 

The  idea  of  a  truth  table  is  to  describe  the  conditions  in  which  a  sentence  is  true  or  false. 
We  do  this  by  identifying  all  the  atomic  sentences  that  compose  that  sentence.  Then,  on 
the  left  side,  we  stipulate  all  the  possible  truth  values  of  these  atomic  sentences  and  write 
these  out.  On  the  right  side,  we  then  identify  under  what  conditions  the  sentence  (that  is 
composed  of  the  other  atomic  sentences)  is  true  or  false. 

The  idea  is  that  the  sentence  on  the  right  is  dependent  on  the  sentence(s)  on  the  left.  So  the 
truth  table  is  filled  in  like  this: 
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Atomic  sentence(s)  that  compose 
the  dependent  sentence  on  the  right 

Dependent  sentence  composed  of  the  atomic 
sentences  on  the  left 

All  possible  combinations  of  truth 
values  of  the  composing  atomic 
sentences 

Resulting  truth  values  for  each  possible 
combination  of  truth  values  of  the  composing 
atomic  sentences 

We  stipulate  all  the  possible  truth  values  on  the  bottom  left  because  the  propositional  logic 
alone  will  not  determine  whether  an  atomic  sentence  is  true  or  false;  thus,  we  will  simply 
have  to  consider  both  possibilities.  Note  that  there  are  many  ways  that  an  atomic  sentence 
can  be  true,  and  there  are  many  ways  that  it  can  be  false.  For  example,  the  sentence,  “Tom 
is  American”  might  be  true  if  Tom  was  born  in  New  York,  in  Texas,  in  Ohio,  and  so  on. 
The  sentence  might  be  false  because  Tom  was  born  to  Italian  parents  in  Italy,  to  French 
parents  in  France,  and  so  on.  So,  we  group  all  these  cases  together  into  two  kinds  of  cases. 

These  are  two  rows  of  the  truth  table  for  an  atomic  sentence.  Each  row  of  the  truth  table 
represents  a  kind  of  way  that  the  world  could  be.  So  here  is  the  left  side  of  a  truth  table 
with  only  a  single  atomic  sentence,  P.  We  will  write  “7”’  for  true  and  “F”  for  false. 


There  are  only  two  relevant  kinds  of  ways  that  the  world  can  be,  when  we  are  considering 
the  semantics  of  an  atomic  sentence.  The  world  can  be  one  of  the  many  conditions  such 
that  P  is  true,  or  it  can  be  one  of  the  many  conditions  such  that  P  is  false. 

To  complete  the  truth  table,  we  place  the  dependent  sentence  on  the  top  right  side,  and 
describe  its  truth  value  in  relation  to  the  truth  value  of  its  parts.  We  want  to  identify  the 
semantics  of  P,  which  has  only  one  part,  P.  The  truth  table  thus  has  the  final  form: 


This  truth  table  tells  us  the  meaning  of  P,  as  far  as  our  propositional  logic  can  tell  us  about  it. 
Thus,  it  gives  us  the  complete  semantics  for  P.  (As  we  will  see  later,  truth  tables  have  three 
uses:  to  provide  the  semantics  for  a  kind  of  sentence;  to  determine  under  what  conditions 
a  complex  sentence  is  true  or  false;  and  to  determine  if  an  argument  is  good.  Here  we  are 
describing  only  this  first  use.) 

In  this  truth  table,  the  first  row  combined  together  all  the  kinds  of  ways  the  world  could 
be  in  which  P  is  true.  In  the  second  column  we  see  that  for  all  of  these  kinds  of  ways  the 
world  could  be  in  which  P  is  true,  unsurprisingly,  P  is  true.  The  second  row  combines 
together  all  the  kinds  of  ways  the  world  could  be  in  which  P  is  false.  In  those,  P  is  false.  As 
we  noted  above,  in  the  case  of  an  atomic  sentence,  the  truth  table  is  trivial.  Nonetheless,  the 
basic  concept  is  very  useful,  as  we  will  begin  to  see  in  the  next  chapter. 
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One  last  tool  will  be  helpful  to  us.  Strictly  speaking,  what  we  have  done  above  is  give 
the  syntax  and  semantics  for  a  particular  atomic  sentence,  P.  We  need  a  way  to  make 
general  claims  about  all  the  sentences  of  our  language,  and  then  give  the  syntax  and 
semantics  for  any  atomic  sentences.  We  do  this  using  variables,  and  here  we  will  use  Greek 
letters  for  those  variables,  such  as  <t>  and  VF.  Things  said  using  these  variables  is  called  our 
“metalanguage”  which  means  literally  the  after  language,  but  which  we  take  to  mean,  our 
language  about  our  language.  The  particular  propositional  logic  that  we  create  is  called  our 
“object  language”.  P  and  Q  are  sentences  of  our  object  language.  $  and  W  are  elements 
of  our  metalanguage.  To  specify  now  the  syntax  of  atomic  sentences  (that  is,  of  all  atomic 
sentences)  we  can  say:  If  $  is  an  atomic  sentence,  then 

$ 

is  a  sentence.  This  tells  us  that  simply  writing  down  (whatever  atomic  sentence  it  may 
be),  as  we  have  just  done,  is  to  write  down  something  that  is  syntactically  correct. 

To  specify  now  the  semantics  of  atomic  sentences  (that  is,  of  all  atomic  sentences)  we  can 
say:  If  $>  is  an  atomic  sentence,  then  the  semantics  of  is  given  by 


<t> 

<5 

T 

T 

F 

F 

Note  an  important  and  subtle  point.  The  atomic  sentences  of  our  propositional  logic  will 
be  what  we  call  “contingent”  sentences.  A  contingent  sentence  can  be  either  true  or  false. 
We  will  see  later  that  some  complex  sentences  of  our  propositional  logic  must  be  true,  and 
some  complex  sentences  of  our  propositional  logic  must  be  false.  But  for  the  propositional 
logic,  every  atomic  sentence  is  (as  far  as  we  can  tell  using  the  propositional  logic  alone) 
contingent.  This  observation  matters  because  it  greatly  helps  to  clarify  where  logic  begins, 
and  where  the  methods  of  another  discipline  ends.  For  example,  suppose  we  have  an  atomic 
sentence  like: 

Force  is  equal  to  mass  times  acceleration. 

Igneous  rocks  formed  under  pressure. 

Germany  inflated  its  currency  in  1923  in  order  to  reduce  its  reparations  debt. 

Logic  cannot  tell  us  whether  these  are  true  or  false.  We  will  turn  to  physicists,  and  use  their 
methods,  to  evaluate  the  first  claim.  We  will  turn  to  geologists,  and  use  their  methods,  to 
evaluate  the  second  claim.  We  will  turn  to  historians,  and  use  their  methods,  to  evaluate 
the  third  claim.  But  the  logician  can  tell  the  physicist,  geologist,  and  historian  what  follows 
from  their  claims. 


1 .5  Problems 

1.  Vagueness  arises  when  the  conditions  under  which  a  sentence  might  be  true  are 

“fuzzy”.  That  is,  in  some  cases,  we  cannot  identify  if  the  sentence  is  true  or  false.  If  we 
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say,  “Tom  is  tall”,  this  sentence  is  certainly  true  if  Tom  is  the  tallest  person  in  the 
world,  but  it  is  not  clear  whether  it  is  true  if  Tom  is  185  centimeters  tall.  Identify  or 
create  five  declarative  sentences  in  English  that  are  vague. 

2.  Ambiguity  usually  arises  when  a  word  or  phrase  has  several  distinct  possible 
interpretations.  In  our  example  above,  the  word  “pen”  could  mean  either  a  writing 
implement  or  a  structure  to  hold  a  child.  A  sentence  that  includes  “pen”  could  be 
ambiguous,  in  which  case  it  might  be  true  for  one  interpretation  and  false  for  another. 

Identify  or  create  five  declarative  sentences  in  English  that  are  ambiguous.  (This  will 
probably  require  you  to  identify  a  homonym,  a  word  that  has  more  than  one  meaning 
but  sounds  or  is  written  the  same.  If  you  are  stumped,  consider  slang:  many  slang 
terms  are  ambiguous  because  they  redefine  existing  words.  For  example,  in  the  1980s, 
in  some  communities  and  contexts,  to  say  something  was  “bad”  meant  that  it  was  good; 
this  obviously  can  create  ambiguous  sentences.) 

3.  Often  we  can  make  a  vague  sentence  precise  by  defining  a  specific  interpretation  of  the 
meaning  of  an  adjective,  term,  or  other  element  of  the  language.  For  example,  we 
could  make  the  sentence  “Tom  is  tall”  precise  by  specifying  one  person  referred  to  by 
“Tom”,  and  also  by  defining  “..  .is  tall”  as  true  of  anyone  180  centimeters  tall  or  taller. 

For  each  of  the  five  vague  sentences  that  you  identified  or  created  for  problem  1, 
describe  how  the  interpretation  of  certain  elements  of  the  sentence  could  make  the 
sentence  no  longer  vague. 

4.  Often  we  can  make  an  ambiguous  sentence  precise  by  specifying  which  of  the  possible 
meanings  we  intend  to  use.  We  could  make  the  sentence,  “Tom  is  by  the  pen” 
unambiguous  by  specifying  which  Tom  we  mean,  and  also  defining  “pen”  to  mean  an 
infant  play  pen.  For  each  of  the  five  ambiguous  sentences  that  you  identified  or  created 
for  problem  2,  identify  and  describe  how  the  interpretation  of  certain  elements  of  the 
sentence  could  make  the  sentence  no  longer  ambiguous. 

5.  Come  up  with  five  examples  of  your  own  of  English  sentences  that  are  not  declarative 
sentences.  (Examples  can  include  commands,  exclamations,  and  promises.) 


[2]  There  is  a  complex  issue  here  that  we  will  discuss  later.  But,  in  brief:  “is”  is  ambiguous; 
it  has  several  meanings.  “Malcolm  Tittle  is”  is  a  sentence  if  it  is  meant  to  assert  the  existence 
of  Malcolm  Tittle.  The  “is”  that  appears  in  the  sentence,  “Malcolm  Tittle  is  tall”,  however, 
is  what  we  call  the  “‘is’  of  predication”.  In  that  sentence,  “is”  is  used  to  assert  that  a  property 
is  had  by  Malcolm  Tittle  (the  property  of  being  tall);  and  here  “is  tall”  is  what  we  are  calling 
a  “predicate”.  So,  the  “is”  of  predication  has  no  clear  meaning  when  appearing  without  the 
rest  of  the  predicate;  it  does  not  assert  existence. 


2.  "If.. .then...."  and  "It  is  not  the  case 

that...." 


2.1  The  Conditional 

As  we  noted  in  chapter  1,  there  are  sentences  of  a  natural  language,  like  English,  that  are 
not  atomic  sentences.  Our  examples  included 

If  Lincoln  wins  the  election,  then  Lincoln  will  be  President. 

The  Earth  is  not  the  center  of  the  universe. 

We  could  treat  these  like  atomic  sentences,  but  then  we  would  lose  a  great  deal  of  important 
information.  Lor  example,  the  first  sentence  tells  us  something  about  the  relationship 
between  the  atomic  sentences  “Lincoln  wins  the  election”  and  “Lincoln  will  be  President”. 
And  the  second  sentence  above  will,  one  supposes,  have  an  interesting  relationship  to  the 
sentence,  “The  Earth  is  the  center  of  the  universe”.  To  make  these  relations  explicit,  we 
will  have  to  understand  what  “if. ..then...”  and  “not”  mean.  Thus,  it  would  be  useful  if 
our  logical  language  was  able  to  express  these  kinds  of  sentences  in  a  way  that  made  these 
elements  explicit.  Let  us  start  with  the  first  one. 

The  sentence,  “If  Lincoln  wins  the  election,  then  Lincoln  will  be  President”  contains  two 
atomic  sentences,  “Lincoln  wins  the  election”  and  “Lincoln  will  be  President”.  We  could 
thus  represent  this  sentence  by  letting 

Lincoln  wins  the  election 

be  represented  in  our  logical  language  by 

P 

And  by  letting 

Lincoln  will  be  president 
be  represented  by 

Q 

Then,  the  whole  expression  could  be  represented  by  writing 
If  P  then  Q 

It  will  be  useful,  however,  to  replace  the  English  phrase  “if. .  .then. . .”  by  a  single  symbol  in 
our  language.  The  most  commonly  used  such  symbol  is  Thus,  we  would  write 
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P-Q 

One  last  thing  needs  to  be  observed,  however.  We  might  want  to  combine  this  complex 
sentence  with  other  sentences.  In  that  case,  we  need  a  way  to  identify  that  this  is  a  single 
sentence  when  it  is  combined  with  other  sentences.  There  are  several  ways  to  do  this,  but 
the  most  familiar  (although  not  the  most  elegant)  is  to  use  parentheses.  Thus,  we  will  write 
our  expression 

(P-Q) 

This  kind  of  sentence  is  called  a  “conditional”.  It  is  also  sometimes  called  a  “material 
conditional”.  The  first  constituent  sentence  (the  one  before  the  arrow,  which  in  this 
example  is  “P”)  is  called  the  “antecedent”.  The  second  sentence  (the  one  after  the  arrow, 
which  in  this  example  is  “Q”)  is  called  the  “consequent”. 

We  know  how  to  write  the  conditional,  but  what  does  it  mean?  As  before,  we  will  take  the 
meaning  to  be  given  by  the  truth  conditions — that  is,  a  description  of  when  the  sentence 
is  either  true  or  false.  We  do  this  with  a  truth  table.  But  now,  our  sentence  has  two  parts 
that  are  atomic  sentences,  P  and  Q.  Note  that  either  atomic  sentence  could  be  true  or  false. 
That  means,  we  have  to  consider  four  possible  kinds  of  situations.  We  must  consider  when 
P  is  true  and  when  it  is  false,  but  then  we  need  to  consider  those  two  kinds  of  situations 
twice:  once  for  when  Q  is  true  and  once  for  when  Q  is  false.  Thus,  the  left  hand  side  of 
our  truth  table  will  look  like  this: 


There  are  four  kinds  of  ways  the  world  could  be  that  we  must  consider. 

Note  that,  since  there  are  two  possible  truth  values  (true  and  false),  whenever  we  consider 
another  atomic  sentence,  there  are  twice  as  many  ways  the  world  could  be  that  we  should 

consider.  Thus,  for  n  atomic  sentences,  our  truth  table  must  have  2n  rows.  In  the  case  of 
a  conditional  formed  out  of  two  atomic  sentences,  like  our  example  of  (P^Q),  our  truth 
table  will  have  2?  rows,  which  is  4  rows.  We  see  this  is  the  case  above. 

Now,  we  must  decide  upon  what  the  conditional  means.  To  some  degree  this  is  up  to 
us.  What  matters  is  that  once  we  define  the  semantics  of  the  conditional,  we  stick  to  our 
definition.  But  we  want  to  capture  as  much  of  the  meaning  of  the  English  “if. .  .then. . as 
we  can,  while  remaining  absolutely  precise  in  our  language. 

Let  us  consider  each  kind  of  way  the  world  could  be.  For  the  first  row  of  the  truth  table, 
we  have  that  P  is  true  and  Q  is  true.  Suppose  the  world  is  such  that  Lincoln  wins  the 
election,  and  also  Lincoln  will  be  President.  Then,  would  I  have  spoken  truly  if  I  said,  “If 
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Lincoln  wins  the  election,  then  Lincoln  will  be  President”?  Most  people  agree  that  I  would 
have.  Similarly,  suppose  that  Lincoln  wins  the  election,  but  Lincoln  will  not  be  President. 
Would  the  sentence  “If  Lincoln  wins  the  election,  then  Lincoln  will  be  President”  still 
be  true?  Most  agree  that  it  would  be  false  now.  So  the  first  rows  of  our  truth  table  are 
uncontroversial. 


p 

£ 

(P-Q) 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

Some  students,  however,  find  it  hard  to  determine  what  truth  values  should  go  in  the  next 
two  rows.  Note  now  that  our  principle  of  bivalence  requires  us  to  fill  in  these  rows.  We 
cannot  leave  them  blank.  If  we  did,  we  would  be  saying  that  sometimes  a  conditional  can 
have  no  truth  value;  that  is,  we  would  be  saying  that  sometimes,  some  sentences  have  no 
truth  value.  But  our  principle  of  bivalence  requires  that — in  all  kinds  of  situations — every 
sentence  is  either  true  or  false,  never  both,  never  neither.  So,  if  we  are  going  to  respect  the 
principle  of  bivalence,  then  we  have  to  put  either  T  or  F  in  for  each  of  the  last  two  rows. 

It  is  helpful  at  this  point  to  change  our  example.  Let  us  consider  two  different  examples  to 
illustrate  how  best  to  fill  out  the  remainder  of  the  truth  table  for  the  conditional. 

First,  suppose  I  say  the  following  to  you:  “If  you  give  me  $50,  then  I  will  buy  you  a  ticket 
to  the  concert  tonight.”  Let 

You  give  me  $50 

be  represented  in  our  logic  by 

R 


and  let 


I  will  buy  you  a  ticket  to  the  concert  tonight, 
be  represented  by 

S 

Our  sentence  then  is 

(R-S) 


And  its  truth  table — as  far  as  we  understand  right  now — is: 
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R 

S 

(R-S) 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

That  is,  if  you  give  me  the  money  and  I  buy  you  the  ticket,  my  claim  that  “If  you  give  me 
$50,  then  I  will  buy  you  a  ticket  to  the  concert  tonight”  is  true.  And,  if  you  give  me  the 
money  and  I  don’t  buy  you  the  ticket,  I  lied,  and  my  claim  is  false.  But  now,  suppose  you 
do  not  give  me  $50,  but  I  buy  you  a  ticket  for  the  concert  as  a  gift.  Was  my  claim  false? 
No.  I  simply  bought  you  the  ticket  as  a  gift,  but,  presumably  would  have  bought  it  if  you 
gave  me  the  money,  also.  Similarly,  if  you  don’t  give  me  money,  and  I  do  not  buy  you  a 
ticket,  that  seems  perfectly  consistent  with  my  claim. 

So,  the  best  way  to  fill  out  the  truth  table  is  as  follows. 


R 

S 

(R-S) 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

Second,  consider  another  sentence,  which  has  the  advantage  that  it  is  very  clear  with  respect 
to  these  last  two  rows.  Assume  that  a  is  a  particular  natural  number,  only  you  and  I  don’t 
know  what  number  it  is  (the  natural  numbers  are  the  whole  positive  numbers:  1,  2,  3,  4. . .). 
Consider  now  the  following  sentence. 

If  a  is  evenly  divisible  by  4,  then  a  is  evenly  divisible  by  2. 

(By  “evenly  divisible,”  I  mean  divisible  without  remainder.)  The  first  thing  to  ask  yourself 
is:  is  this  sentence  true?  I  hope  we  can  all  agree  that  it  is — even  though  we  do  not  know 
what  a  is.  Let 

a  is  evenly  divisible  by  4 

be  represented  in  our  logic  by 

U 


and  let 


a  is  evenly  divisible  by  2 


be  represented  by 
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V 

Our  sentence  then  is 

(U-V) 

And  its  truth  table — as  far  as  we  understand  right  now — is: 


u 

V 

(u-v) 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

Now  consider  a  case  in  which  a  is  6.  This  is  like  the  third  row  of  the  truth  table.  It  is  not 
the  case  that  6  is  evenly  divisible  by  4,  but  it  is  the  case  that  6  is  evenly  divisible  by  2.  And 
consider  the  case  in  which  a  is  7.  This  is  like  the  fourth  row  of  the  truth  table;  7  would  be 
evenly  divisible  by  neither  4  nor  2.  But  we  agreed  that  the  conditional  is  true — regardless 
of  the  value  of  al  So,  the  truth  table  must  bed3-1 


U 

V 

(u-v) 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

Following  this  pattern,  we  should  also  fill  out  our  table  about  the  election  with: 


P 

Q 

(P-Q) 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

If  you  are  dissatisfied  by  this,  it  might  be  helpful  to  think  of  these  last  two  rows  as  vacuous 
cases.  A  conditional  tells  us  about  what  happens  if  the  antecedent  is  true.  But  when  the 
antecedent  is  false,  we  simply  default  to  true. 

We  are  now  ready  to  offer,  in  a  more  formal  way,  the  syntax  and  semantics  for  the 
conditional. 

The  syntax  of  the  conditional  is  that,  if  and  VF  are  sentences,  then 


20  A  Concise  Introduction  to  Logic 


(fc-'P) 

is  a  sentence. 

The  semantics  of  the  conditional  are  given  by  a  truth  table.  For  any  sentences  $  and  W: 


<t> 

W 

(< 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

Remember  that  this  truth  table  is  now  a  definition.  It  defines  the  meaning  of  We  are 
agreeing  to  use  the  symbol  to  mean  this  from  here  on  out. 

The  elements  of  the  propositional  logic,  like  that  we  add  to  our  language  in  order  to 
form  more  complex  sentences,  are  called  “truth  functional  connectives”.  I  hope  it  is  clear 
why:  the  meaning  of  this  symbol  is  given  in  a  truth  function.  (If  you  are  unfamiliar  or 
uncertain  about  the  idea  of  a  function,  think  of  a  function  as  like  a  machine  that  takes  in  one 
or  more  inputs,  and  always  then  gives  exactly  one  output.  For  the  conditional,  the  inputs 
are  two  truth  values;  and  the  output  is  one  truth  value.  For  example,  put  T  F  into  the  truth 
function  called  and  you  get  out  F .) 

2.2  Alternative  phrasings  in  English  for  the  conditional.  Only  if. 

English  includes  many  alternative  phrasings  that  appear  to  be  equivalent  to  the  conditional. 
Furthermore,  in  English  and  other  natural  languages,  the  order  of  the  conditional  will 
sometimes  be  reversed.  We  can  capture  the  general  sense  of  these  cases  by  recognizing  that 
each  of  the  following  phrasings  would  be  translated  as  (P->Q).  (In  these  examples,  we  mix 
English  and  our  propositional  logic,  in  order  to  illustrate  the  variations  succinctly.) 

If  P,  then  Q. 

Q,  ifP. 

On  the  condition  that  P,  Q. 

Q,  on  the  condition  that  P. 

Given  that  P,  Q. 

Q,  given  that  P. 

Provided  that  P,  Q. 

Q,  provided  that  P. 
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When  P,  then  Q. 

Q,  when  P. 

P  implies  Q. 

Q  is  implied  by  P. 

P  is  sufficient  for  Q. 

Q  is  necessary  for  P. 

An  oddity  of  English  is  that  the  word  “only”  changes  the  meaning  of  “if”.  You  can  see  this 
if  you  consider  the  following  two  sentences. 

Fifi  is  a  cat,  if  Fifi  is  a  mammal. 

Fifi  is  a  cat  only  if  Fifi  is  a  mammal. 

Suppose  we  know  Fifi  is  an  organism,  but,  we  don’t  know  what  kind  of  organism  Fifi  is. 
Fifi  could  be  a  dog,  a  cat,  a  gray  whale,  a  ladybug,  a  sponge.  It  seems  clear  that  the  first 
sentence  is  not  necessarily  true.  If  Fifi  is  a  gray  whale,  for  example,  then  it  is  true  that 
Fifi  is  a  mammal,  but  false  that  Fifi  is  a  cat;  and  so,  the  first  sentence  would  be  false.  But 
the  second  sentence  looks  like  it  must  be  true  (given  what  you  and  I  know  about  cats  and 
mammals). 

We  should  thus  be  careful  to  recognize  that  “only  if”  does  not  mean  the  same  thing  as  “if”. 
(If  it  did,  these  two  sentences  would  have  the  same  truth  value  in  all  situations.)  In  fact, 
it  seems  that  “only  if”  can  best  be  expressed  by  a  conditional  where  the  “only  if”  appears 
before  the  consequent  (remember,  the  consequent  is  the  second  part  of  the  conditional — the 
part  that  the  arrows  points  at).  Thus,  sentences  of  this  form: 

P  only  if  Q. 

Only  if  Q,  P. 

are  best  expressed  by  the  formula 

(P-Q) 


2.3  Test  your  understanding  of  the  conditional 

People  sometimes  find  conditionals  confusing.  In  part,  this  seems  to  be  because  some 
people  confuse  them  with  another  kind  of  truth-functional  connective,  which  we  will  leam 
about  later,  called  the  “biconditional”.  Also,  sometimes  “if. .  .then. . .”  is  used  in  English  in  a 
different  way  (see  section  17.7  if  you  are  curious  about  alternative  possible  meanings).  But 
from  now  on,  we  will  understand  the  conditional  as  described  above.  To  test  whether  you 
have  properly  grasped  the  conditional,  consider  the  following  puzzled4-1 

We  have  a  set  of  four  cards  in  figure  2.1.  Each  card  has  the  following  property:  it  has  a 
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shape  on  one  side,  and  a  letter  on  the  other  side.  We  shuffle  and  mix  the  cards,  flipping 
some  over  while  we  shuffle.  Then,  we  lay  out  the  four  cards: 


Card  1  Card  2  Card  3  Card  4 


r  ■  q  ★ 


Figure  2.1 

Given  our  constraint  that  each  card  has  a  letter  on  one  side  and  a  shape  on  the  other,  we 
know  that  card  1  has  a  shape  on  the  unseen  side;  card  2  has  a  letter  on  the  unseen  side;  and 
so  on. 

Consider  now  the  following  claim: 

For  each  of  these  four  cards,  if  the  card  has  a  Q  on  the  letter  side  of  the  card,  then 
it  has  a  square  on  the  shape  side  of  the  card. 

Here  is  our  puzzle:  what  is  the  minimum  number  of  cards  that  we  must  turn  over  to  test 
whether  this  claim  is  true  of  all  four  cards;  and  which  cards  are  they  that  we  must  turn  over? 
Of  course  we  could  turn  them  all  over,  but  the  puzzle  asks  you  to  identify  all  and  only  the 
cards  that  will  test  the  claim. 

Stop  reading  now,  and  see  if  you  can  decide  on  the  answer.  Be  warned,  people  generally 
perform  poorly  on  this  puzzle.  Think  about  it  for  a  while.  The  answer  is  given  below  in 
problem  1. 


2.4  Alternative  symbolizations  for  the  conditional 

Some  logic  books,  and  some  logicians,  use  alternative  symbolizations  for  the  various  truth- 
functional  connectives.  The  meanings  (that  is,  the  truth  tables)  are  always  the  same,  but  the 
symbol  used  may  be  different.  For  this  reason,  we  will  take  the  time  in  this  text  to  briefly 
recognize  alternative  symbolizations. 

The  conditional  is  sometimes  represented  with  the  following  symbol:  “d”.  Thus,  in  such  a 
case,  (P->Q)  would  be  written 


(PdQ) 
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2.5  Negation 

In  chapter  1,  we  considered  as  an  example  the  sentence, 

The  Earth  is  not  the  center  of  the  universe. 

At  first  glance,  such  a  sentence  might  appear  to  be  fundamentally  unlike  a  conditional. 
It  does  not  contain  two  sentences,  but  only  one.  There  is  a  “not”  in  the  sentence,  but 
it  is  not  connecting  two  sentences.  However,  we  can  still  think  of  this  sentence  as  being 
constructed  with  a  truth  functional  connective,  if  we  are  willing  to  accept  that  this  sentence 
is  equivalent  to  the  following  sentence. 

It  is  not  the  case  that  the  Earth  is  the  center  of  the  universe. 

If  this  sentence  is  equivalent  to  the  one  above,  then  we  can  treat  “It  is  not  the  case”  as  a  truth 
functional  connective.  It  is  traditional  to  replace  this  cumbersome  English  phrase  with  a 
single  symbol,  “-i”.  Then,  mixing  our  propositional  logic  with  English,  we  would  have 

iThe  Earth  is  the  center  of  the  universe. 

And  if  we  let  W  be  a  sentence  in  our  language  that  has  the  meaning  The  Earth  is  the  center 
of  the  universe,  we  would  write 

AW 

This  connective  is  called  “negation”.  Its  syntax  is:  if  $  is  a  sentence,  then 

is  a  sentence.  We  call  such  a  sentence  a  “negation  sentence”. 

The  semantics  of  a  negation  sentence  is  also  obvious,  and  is  given  by  the  following  truth 
table. 


<t> 

T 

F 

F 

T 

To  deny  a  true  sentence  is  to  speak  a  falsehood.  To  deny  a  false  sentence  is  to  say  something 
true. 

Our  syntax  always  is  recursive.  This  means  that  syntactic  rules  can  be  applied  repeatedly,  to 
the  product  of  the  rule.  In  other  words,  our  syntax  tells  us  that  if  P  is  a  sentence,  then  -iP  is 
a  sentence.  But  now  note  that  the  same  rule  applies  again:  if  -iP  is  a  sentence,  then  -i->P  is 
a  sentence.  And  so  on.  Similarly,  if  P  and  Q  are  sentences,  the  syntax  for  the  conditional 
tells  us  that  (P-»Q)  is  a  sentence.  But  then  so  is  -(P-Q).  and  so  is  HP-Q)  -»  (P-Q)). 
And  so  on.  If  we  have  just  a  single  atomic  sentence,  our  recursive  syntax  will  allow  us  to 
form  infinitely  many  different  sentences  with  negation  and  the  conditional. 
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2.6  Alternative  symbolizations  for  negation 

Some  texts  may  use  for  negation.  Thus,  -iP  would  be  expressed  with 

~P 

2.7  Problems 

1.  The  answer  to  our  card  game  was:  you  need  only  turn  over  cards  3  and  4.  This  might 
seem  confusing  to  many  people  at  first.  But  remember  the  meaning  of  the  conditional: 
it  can  only  be  false  if  the  first  part  is  true  and  the  second  part  is  false.  The  sentence  we 
want  to  test  is  “For  each  of  these  four  cards,  if  the  card  has  a  Q  on  the  letter  side  of  the 
card,  then  it  has  a  square  on  the  shape  side  of  the  card”.  Let  Q  stand  for  “the  card  has  a 
Q  on  the  letter  side  of  the  card.”  Let  S  stand  for  “the  card  has  a  square  on  the  shape 
side  of  the  card.”  Then  we  could  make  a  truth  table  to  express  the  meaning  of  the 
claim  being  tested: 


Q 

S 

(Q-s) 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

Look  back  at  the  cards.  The  first  card  has  an  R  on  the  letter  side.  So,  sentence  Q  is  false. 
But  then  we  are  in  a  situation  like  the  last  two  rows  of  the  truth  table,  and  the  conditional 
cannot  be  false.  We  do  not  need  to  check  that  card.  The  second  card  has  a  square  on  it. 
That  means  S  is  true  for  that  card.  But  then  we  are  in  a  situation  represented  by  either  the 
first  or  third  row  of  the  truth  table.  Again,  the  claim  that  (Q^S)  cannot  be  false  in  either 
case  with  respect  to  that  card,  so  there  is  no  point  in  checking  that  card.  The  third  card 
shows  a  Q.  It  corresponds  to  a  situation  that  is  like  either  the  first  or  second  row  of  the  truth 
table.  We  cannot  tell  then  whether  (Q->S)  is  true  or  false  of  that  card,  without  turning  the 
card  over.  Similarly,  the  last  card  shows  a  situation  where  S  is  false,  so  we  are  in  a  kind  of 
situation  represented  by  either  the  second  or  last  row  of  the  truth  table.  We  must  turn  the 
card  over  to  determine  if  (Q->S)  is  true  or  false  of  that  card. 

Try  this  puzzle  again.  Consider  the  following  claim  about  those  same  four  cards:  If  there 
is  a  star  on  the  shape  side  of  the  card,  then  there  is  an  R  on  the  letter  side  of  the  card.  What 
is  the  minimum  number  of  cards  that  you  must  turn  over  to  check  this  claim?  What  cards 
are  they? 

2.  Consider  the  following  four  cards  in  figure  2.2.  Each  card  has  a  letter  on  one  side,  and 
a  shape  on  the  other  side. 

For  each  of  the  following  claims,  in  order  to  determine  if  the  claim  is  true  of  all  four  cards, 
describe  (l)  The  minimum  number  of  cards  you  must  turn  over  to  check  the  claim,  and  (2) 
what  those  cards  are. 

a.  There  is  not  a  Q  on  the  letter  side  of  the  card. 
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Card  1  Card  2  Card  3  Card  4 

p  ▲  •  ♦ 


Figure  2.2 

b.  There  is  not  an  octagon  on  the  shape  side  of  the  card. 

c.  If  there  is  a  triangle  on  the  shape  side  of  the  card,  then  there  is  a  P  on  the  letter  side  of 
the  card. 

d.  There  is  an  R  on  the  letter  side  of  the  card  only  if  there  is  a  diamond  on  the  shape  side 
of  the  card. 

e.  There  is  a  hexagon  on  the  shape  side  of  the  card,  on  the  condition  that  there  is  a  P  on 
the  letter  side  of  the  card. 

f.  There  is  a  diamond  on  the  shape  side  of  the  card  only  if  there  is  a  P  on  the  letter  side 
of  the  card. 

3.  Which  of  the  following  have  correct  syntax?  Which  have  incorrect  syntax? 

a.  P->Q 

b.  ^(P-Q) 

c.  CP-Q) 

d.  (P-i-»Q) 

e.  (P-»->Q) 

4.  Use  the  following  translation  key  to  translate  the  following  sentences  into  a  propositional 
logic. 


Translation  Key 

Logic 

English 

P 

Abe  is  able. 

Q 

Abe  is  honest. 

a.  If  Abe  is  honest,  Abe  is  able. 
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b.  Abe  is  not  able. 

c.  Abe  is  not  able  only  if  Abe  is  not  honest. 

d.  Abe  is  able,  provided  that  Abe  is  not  honest. 

e.  If  Abe  is  not  able  then  Abe  is  not  honest. 

5.  Make  up  your  own  translation  key  to  translate  the  following  sentences  into  a 
propositional  logic.  Then,  use  your  key  to  translate  the  sentences  into  the  propositional 
logic.  Your  translation  key  should  contain  only  atomic  sentences.  These  should  be  all  and 
only  the  atomic  sentences  needed  to  translate  the  following  sentences  of  English.  Don’t  let 
it  bother  you  that  some  of  the  sentences  must  be  false. 

a.  Josie  is  a  cat. 

b.  Josie  is  a  mammal. 

c.  Josie  is  not  a  mammal. 

d.  If  Josie  is  not  a  cat,  then  Josie  is  not  a  mammal. 

e.  Josie  is  a  fish. 

f.  Provided  that  Josie  is  a  mammal,  Josie  is  not  a  fish. 

g.  Josie  is  a  cat  only  if  Josie  is  a  mammal. 

h.  Josie  is  a  fish  only  if  Josie  is  not  a  mammal. 

i.  It’s  not  the  case  that  Josie  is  not  a  mammal. 

j.  Josie  is  not  a  cat,  if  Josie  is  a  fish. 

6.  This  problem  will  make  use  of  the  principle  that  our  syntax  is  recursive.  Translating 
these  sentences  is  more  challenging.  Make  up  your  own  translation  key  to  translate  the 
following  sentences  into  a  propositional  logic.  Your  translation  key  should  contain  only 
atomic  sentences;  these  should  be  all  and  only  the  atomic  sentences  needed  to  translate  the 
following  sentences  of  English. 

a.  It  is  not  the  case  that  Tom  won’t  pass  the  exam. 

b.  If  Tom  studies,  Tom  will  pass  the  exam. 

c.  It  is  not  the  case  that  if  Tom  studies,  then  Tom  will  pass  the  exam. 

d.  If  Tom  does  not  study,  then  Tom  will  not  pass  the  exam. 

e.  If  Tom  studies,  Tom  will  pass  the  exam — provided  that  he  wakes  in  time. 

f.  If  Tom  passes  the  exam,  then  if  Steve  studies,  Steve  will  pass  the  exam. 

g.  It  is  not  the  case  that  if  Tom  passes  the  exam,  then  if  Steve  studies,  Steve  will  pass  the 
exam. 

h.  If  Tom  does  not  pass  the  exam,  then  if  Steve  studies,  Steve  will  pass  the  exam. 

i.  If  Tom  does  not  pass  the  exam,  then  it  is  not  the  case  that  if  Steve  studies,  Steve  will 
pass  the  exam. 

j.  If  Tom  does  not  pass  the  exam,  then  if  Steve  does  not  study,  Steve  won’t  pass  the 
exam. 

7.  Make  up  your  own  translation  key  in  order  to  translate  the  following  sentences  into 
English.  Write  out  the  English  equivalents  in  English  sentences  that  seem  (as  much  as  is 
possible)  natural. 
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a.  (R-S) 

b.  -i-iR 

c.  (S^R) 

d.  (iS- >"-'R) 

e.  --(R-»S) 


[3]  One  thing  is  a  little  funny  about  this  second  example  with  unknown  number  a.  We 
will  not  be  able  to  find  a  number  that  is  evenly  divisible  by  4  and  not  evenly  divisible  by 
2,  so  the  world  will  never  be  like  the  second  row  of  this  truth  table  describes.  Two  things 
need  to  be  said  about  this.  First,  this  oddity  arises  because  of  mathematical  facts,  not  facts  of 
our  propositional  logic — that  is,  we  need  to  know  what  “divisible”  means,  what  “4”  and  “2” 
mean,  and  so  on,  in  order  to  understand  the  sentence.  So,  when  we  see  that  the  second  row 
is  not  possible,  we  are  basing  that  on  our  knowledge  of  mathematics,  not  on  our  knowledge 
of  propositional  logic.  Second,  some  conditionals  can  be  false.  In  defining  the  conditional, 
we  need  to  consider  all  possible  conditionals;  so,  we  must  define  the  conditional  for  any 
case  where  the  antecedent  is  true  and  the  consequent  is  false,  even  if  that  cannot  happen  for 
this  specific  example. 


3.  Good  Arguments 

3.1  A  historical  example 

An  important  example  of  excellent  reasoning  can  be  found  in  the  case  of  the  medical 
advances  of  the  Nineteenth  Century  physician,  Ignaz  Semmelweis.  Semmelweis  was  an 
obstetrician  at  the  Vienna  General  Hospital.  Built  on  the  foundation  of  a  poor  house, 
and  opened  in  1784,  the  General  Hospital  is  still  operating  today.  Semmelweis,  during 
his  tenure  as  assistant  to  the  head  of  one  of  two  maternity  clinics,  noticed  something  very 
disturbing.  The  hospital  had  two  clinics,  separated  only  by  a  shared  anteroom,  known  as 
the  First  and  the  Second  Clinics.  The  mortality  rate  for  mothers  delivering  babies  in  the 
First  Clinic,  however,  was  nearly  three  times  as  bad  as  the  mortality  for  mothers  in  the 
Second  Clinic  (9.9  %  average  versus  3.4%  average).  The  same  was  true  for  the  babies  bom 
in  the  clinics:  the  mortality  rate  in  the  First  Clinic  was  6.1%  versus  2.1%  at  the  Second 
Clinic. [5]  In  nearly  all  these  cases,  the  deaths  were  caused  by  what  appeared  to  be  the  same 
illness,  commonly  called  “childbed  fever”.  Worse,  these  numbers  actually  understated  the 
mortality  rate  of  the  First  Clinic,  because  sometimes  very  ill  patients  were  transferred  to  the 
general  treatment  portion  of  the  hospital,  and  when  they  died,  their  death  was  counted  as 
part  of  the  mortality  rate  of  the  general  hospital,  not  of  the  First  Clinic. 

Semmelweis  set  about  trying  to  determine  why  the  First  Clinic  had  the  higher  mortality 
rate.  He  considered  a  number  of  hypotheses,  many  of  which  were  suggested  by  or  believed 
by  other  doctors. 

One  hypothesis  was  that  cosmic-atmospheric-terrestrial  influences  caused  childbed  fever. 
The  idea  here  was  that  some  kind  of  feature  of  the  atmosphere  would  cause  the  disease. 
But,  Semmelweis  observed,  the  First  and  Second  Clinics  were  very  close  to  each  other, 
had  similar  ventilation,  and  shared  a  common  anteroom.  So,  they  had  similar  atmospheric 
conditions.  He  reasoned:  If  childbed  fever  is  caused  by  cosmic-atmospheric-terrestrial 
influences,  then  the  mortality  rate  would  be  similar  in  the  First  and  Second  Clinics.  But  the 
mortality  rate  was  not  similar  in  the  First  and  Second  Clinics.  So,  the  childbed  fever  was 
not  caused  by  cosmic-atmospheric-terrestrial  influences. 

Another  hypothesis  was  that  overcrowding  caused  the  childbed  fever.  But,  if  overcrowding 
caused  the  childbed  fever,  then  the  more  crowded  of  the  two  clinics  should  have  the  higher 
mortality  rate.  But,  the  Second  Clinic  was  more  crowded  (in  part  because,  aware  of  its 
lower  mortality  rate,  mothers  fought  desperately  to  be  put  there  instead  of  in  the  First 
Clinic).  It  did  not  have  a  higher  mortality  rate.  So,  the  childbed  fever  was  not  caused  by 
overcrowding. 

Another  hypothesis  was  that  fear  caused  the  childbed  fever.  In  the  Second  Clinic,  the  priest 
delivering  last  rites  could  walk  directly  to  a  dying  patient’s  room.  For  reasons  of  the  layout 
of  the  rooms,  the  priest  delivering  last  rites  in  the  First  Clinic  walked  by  all  the  rooms, 
ringing  a  bell  announcing  his  approach.  This  frightened  patients;  they  could  not  tell  if  the 
priest  was  coming  for  them.  Semmelweis  arranged  a  different  route  for  the  priest  and  asked 
him  to  silence  his  bell.  He  reasoned:  if  the  higher  rate  of  childbed  fever  was  caused  by  fear 
of  death  resulting  from  the  priest’s  approach,  then  the  rate  of  childbed  fever  should  decline 
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if  people  could  not  tell  when  the  priest  was  coming  to  the  Clinic.  But  it  was  not  the  case 
that  the  rate  of  childbed  fever  declined  when  people  could  not  tell  if  the  priest  was  coming 
to  the  First  Clinic.  So,  the  higher  rate  of  childbed  fever  in  the  First  Clinic  was  not  caused 
by  fear  of  death  resulting  from  the  priest’s  approach. 

In  the  First  Clinic,  male  doctors  were  trained;  this  was  not  true  in  the  Second  Clinic. 
These  male  doctors  performed  autopsies  across  the  hall  from  the  clinic,  before  delivering 
babies.  Semmelweis  knew  of  a  doctor  who  cut  himself  while  performing  an  autopsy,  and 
who  then  died  a  terrible  death  not  unlike  that  of  the  mothers  who  died  of  childbed  fever. 
Semmelweis  formed  a  hypothesis.  The  childbed  fever  was  caused  by  something  on  the 
hands  of  the  doctors,  something  that  they  picked  up  from  corpses  during  autopsies,  but  that 
infected  the  women  and  infants.  He  reasoned  that:  if  the  fever  was  caused  by  cadaveric 
matter  on  the  hands  of  the  doctors,  then  the  mortality  rate  would  drop  when  doctors 
washed  their  hands  with  chlorinated  water  before  delivering  babies.  He  forced  the  doctors 
to  do  this.  The  result  was  that  the  mortality  rate  dropped  to  a  rate  below  that  even  of  the 
Second  Clinic. 

Semmelweis  concluded  that  the  best  explanation  of  the  higher  mortality  rate  was  this 
“cadaveric  matter”  on  the  hands  of  doctors.  He  was  the  first  person  to  see  that  washing  of 
hands  with  sterilizing  cleaners  would  save  thousands  of  lives.  It  is  hard  to  overstate  how 
important  this  contribution  is  to  human  well  being.  Semmelweis’s  fine  reasoning  deserves 
our  endless  respect  and  gratitude. 

But  how  can  we  be  sure  his  reasoning  was  good?  Semmelweis  was  essentially  considering 
a  series  of  arguments.  Let  us  turn  to  the  question:  how  shall  we  evaluate  arguments? 

3.2  Arguments 

Our  logical  language  now  allows  us  to  say  conditional  and  negation  statements.  That  may 
not  seem  like  much,  but  our  language  is  now  complex  enough  for  us  to  develop  the  idea  of 
using  our  logic  not  just  to  describe  things,  but  also  to  reason  about  those  things. 

We  will  think  of  reasoning  as  providing  an  argument.  Here,  we  use  the  word  “argument” 
not  in  the  sense  of  two  or  more  people  criticizing  each  other,  but  rather  in  the  sense  we 
mean  when  we  say,  “Pythagoras’s  argument”.  In  such  a  case,  someone  is  using  language  to 
try  to  convince  us  that  something  is  true.  Our  goal  is  to  make  this  notion  very  precise,  and 
then  identify  what  makes  an  argument  good. 

We  need  to  begin  by  making  the  notion  of  an  argument  precise.  Our  logical  language  so 
far  contains  only  sentences.  An  argument  will,  therefore,  consist  of  sentences.  In  a  natural 
language,  we  use  the  term  “argument”  in  a  strong  way,  which  includes  the  suggestion 
that  the  argument  should  be  good.  However,  we  want  to  separate  the  notion  of  a  good 
argument  from  the  notion  of  an  argument,  so  we  can  identify  what  makes  an  argument 
good,  and  what  makes  an  argument  bad.  To  do  this,  we  will  start  with  a  minimal  notion 
of  what  an  argument  is.  Here  is  the  simplest,  most  minimal  notion: 

Argument:  an  ordered  list  of  sentences;  we  call  one  of  these  sentences  the 
“conclusion”,  and  we  call  the  other  sentences  “premises”. 

This  is  obviously  very  weak.  (There  is  a  famous  Monty  Python  skit  where  one  of  the 
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comedians  ridicules  the  very  idea  that  such  a  thing  could  be  called  an  argument.)  But  for 
our  purposes,  this  is  a  useful  notion  because  it  is  very  clearly  defined,  and  we  can  now  ask, 
what  makes  an  argument  good? 

The  everyday  notion  of  an  argument  is  that  it  is  used  to  convince  us  to  believe  something. 
The  thing  that  we  are  being  encouraged  to  believe  is  the  conclusion.  Following  our 
definition  of  “argument”,  the  reasons  that  the  person  gives  will  be  what  we  are  calling 
“premises”.  But  belief  is  a  psychological  notion.  We  instead  are  interested  only  in  truth. 
So,  we  can  reformulate  this  intuitive  notion  of  what  an  argument  should  do,  and  think  of 
an  argument  as  being  used  to  show  that  something  is  true.  The  premises  of  the  argument 
are  meant  to  show  us  that  the  conclusion  is  true. 

What  then  should  be  this  relation  between  the  premises  and  the  conclusion?  Intuitive 
notions  include  that  the  premises  should  support  the  conclusion,  or  corroborate  the 
conclusion,  or  make  the  conclusion  true.  But  “support”  and  “corroborate”  sound  rather 
weak,  and  “make”  is  not  very  clear.  What  we  can  use  in  their  place  is  a  stronger  standard: 
let  us  say  as  a  first  approximation  that  if  the  premises  are  true,  the  conclusion  is  true. 

But  even  this  seems  weak,  on  reflection.  For,  the  conclusion  could  be  true  by  accident, 
for  reasons  unrelated  to  our  premises.  Remember  that  we  define  the  conditional  as  true  if 
the  antecedent  and  consequent  are  true.  But  this  could  happen  by  accident.  For  example, 
suppose  I  say,  “If  Tom  wears  blue  then  he  will  get  an  A  on  the  exam”.  Suppose  also  that 
Tom  both  wears  blue  and  Tom  gets  an  A  on  the  exam.  This  makes  the  conditional  true, 
but  (we  hope)  the  color  of  his  clothes  really  had  nothing  to  do  with  his  performance  on  the 
exam.  Just  so,  we  want  our  definition  of  “good  argument”  to  be  such  that  it  cannot  be  an 
accident  that  the  premises  and  conclusion  are  both  true. 

A  better  and  stronger  standard  would  be  that,  necessarily,  given  true  premises,  the 
conclusion  is  true. 

This  points  us  to  our  definition  of  a  good  argument.  It  is  traditional  to  call  a  good  argument 
“valid.” 


Valid  argument:  an  argument  for  which,  necessarily,  if  the  premises  are  true,  then 
the  conclusion  is  true. 

This  is  the  single  most  important  principle  in  this  book.  Memorize  it. 

A  bad  argument  is  an  argument  that  is  not  valid.  Our  name  for  this  will  be  an  “invalid 
argument”. 

Sometimes,  a  dictionary  or  other  book  will  define  or  describe  a  “valid  argument”  as  an 
argument  that  follows  the  rules  of  logic.  This  is  a  hopeless  way  to  define  “valid”,  because 
it  is  circular  in  a  pernicious  way:  we  are  going  to  create  the  rules  of  our  logic  in  order 
to  ensure  that  they  construct  valid  arguments.  We  cannot  make  rules  of  logical  reasoning 
until  we  know  what  we  want  those  rules  to  do,  and  what  we  want  them  to  do  is  to  create 
valid  arguments.  So  “valid”  must  be  defined  before  we  can  make  our  reasoning  system. 

Experience  shows  that  if  a  student  is  to  err  in  understanding  this  definition  of  “valid 
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argument”,  he  or  she  will  typically  make  the  error  of  assuming  that  a  valid  argument  has 
all  true  premises.  This  is  not  required.  There  are  valid  arguments  with  false  premises  and  a 
false  conclusion.  Here’s  one: 

If  Miami  is  the  capital  of  Kansas,  then  Miami  is  in  Canada.  Miami  is  the  capital 
of  Kansas.  Therefore,  Miami  is  in  Canada. 

This  argument  has  at  least  one  false  premise:  Miami  is  not  the  capital  of  Kansas.  And  the 
conclusion  is  false:  Miami  is  not  in  Canada.  But  the  argument  is  valid:  if  the  premises 
were  both  true,  the  conclusion  would  have  to  be  true.  (If  that  bothers  you,  hold  on  a  while 
and  we  will  convince  you  that  this  argument  is  valid  because  of  its  form  alone.  Also,  keep 
in  mind  always  that  “if. . .then. . .”  is  interpreted  as  meaning  the  conditional.) 

Similarly,  there  are  invalid  arguments  with  true  premises,  and  with  a  true  conclusion. 
Here’s  one: 

If  Miami  is  the  capital  of  Ontario,  then  Miami  is  in  Canada.  Miami  is  not  the 
capital  of  Ontario.  Therefore,  Miami  is  not  in  Canada. 

(If  you  find  it  confusing  that  this  argument  is  invalid,  look  at  it  again  after  you  finish 
reading  this  chapter.) 

Validity  is  about  the  relationship  between  the  sentences  in  the  argument.  It  is  not  a  claim 
that  those  sentences  are  true. 

Another  variation  of  this  confusion  seems  to  arise  when  we  forgot  to  think  carefully  about 
the  conditional.  The  definition  of  valid  is  not  “All  the  premises  are  true,  so  the  conclusion  is 
true.”  If  you  don’t  see  the  difference,  consider  the  following  two  sentences.  “If  your  house 
is  on  fire,  then  you  should  call  the  fire  department.”  In  this  sentence,  there  is  no  claim  that 
your  house  is  on  fire.  It  is  rather  advice  about  what  you  should  do  if  your  house  is  on 
fire.  In  the  same  way,  the  definition  of  valid  argument  does  not  tell  you  that  the  premises 
are  true.  It  tells  you  what  follows  if  they  are  true.  Contrast  now,  “Your  house  is  on  fire, 
so  you  should  call  the  fire  department”.  This  sentence  delivers  very  bad  news.  It  is  not  a 
conditional  at  all.  What  it  really  means  is,  “Your  house  is  on  fire  and  you  should  call  the  fire 
department”.  Our  definition  of  valid  is  not,  “All  the  premises  are  true  and  the  conclusion  is 
true”. 

Finally,  another  common  mistake  is  to  confuse  true  and  valid.  In  the  sense  that  we  are  using 
these  terms  in  this  book,  only  sentences  can  be  true  or  false,  and  only  arguments  can  be 
valid  and  invalid.  When  discussing  and  using  our  logical  language,  it  is  nonsense  to  say,  “a 
true  argument”,  and  it  is  nonsense  to  say,  “a  valid  sentence”. 

Someone  new  to  logic  might  wonder,  why  would  we  want  a  definition  of  “good 
argument”  that  does  not  guarantee  that  our  conclusion  is  true?  The  answer  is  that  logic  is 
an  enormously  powerful  tool  for  checking  arguments,  and  we  want  to  be  able  to  identify 
what  the  good  arguments  are,  independently  of  the  particular  premises  that  we  use  in  the 
argument.  For  example,  there  are  infinitely  many  particular  arguments  that  have  the  same 
form  as  the  valid  argument  given  above.  There  are  infinitely  many  particular  arguments 
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that  have  the  same  form  as  the  invalid  argument  given  above.  Logic  lets  us  embrace  all  the 
former  arguments  at  once,  and  reject  all  those  bad  ones  at  once. 

Furthermore,  our  propositional  logic  will  not  be  able  to  tell  us  whether  an  atomic  sentence 
is  true.  If  our  argument  is  about  rocks,  we  must  ask  the  geologist  if  the  premises  are  true. 
If  our  argument  is  about  history,  we  must  ask  the  historian  if  the  premises  are  true.  If  our 
argument  is  about  music,  we  must  ask  the  music  theorist  if  the  premises  are  true.  But  the 
logician  can  tell  the  geologist,  the  historian,  and  the  musicologist  whether  her  arguments 
are  good  or  bad,  independent  of  the  particular  premises. 

We  do  have  a  common  term  for  a  good  argument  that  has  true  premises.  This  is  called 
“sound”.  It  is  a  useful  notion  when  we  are  applying  our  logic.  Here  is  our  definition: 

Sound  argument:  a  valid  argument  with  true  premises. 

A  sound  argument  must  have  a  true  conclusion,  given  the  definition  of  “valid”. 

3.3  Checking  arguments  semantically 

Every  element  of  our  definition  of  “valid”  is  clear  except  for  one.  We  know  what 
“if.  ..then...”  means.  We  defined  the  semantics  of  the  conditional  in  chapter  2.  We  have 
defined  “argument”,  “premise”,  and  “conclusion”.  We  take  true  and  false  as  primitives.  But 
what  does  “necessarily”  mean? 

We  define  a  valid  argument  as  one  where,  necessarily,  if  the  premises  are  true,  then  the 
conclusion  is  true.  It  would  seem  the  best  way  to  understand  this  is  to  say,  there  is  no 
situation  in  which  the  premises  are  true  but  the  conclusion  is  false.  But  then,  what  are  these 
“situations”?  Fortunately,  we  already  have  a  tool  that  looks  like  it  could  help  us:  the  truth 
table. 

Remember  that  in  the  truth  table,  we  put  on  the  bottom  left  side  all  the  possible 
combinations  of  truth  values  of  some  set  of  atomic  sentences.  Each  row  of  the  table 
then  represents  a  kind  of  way  the  world  could  be.  Using  this  as  a  way  to  understand 
“necessarily”,  we  could  rephrase  our  definition  of  valid  to  something  like  this,  “In  any  kind 
of  situation  in  which  all  the  premises  are  true,  the  conclusion  is  true.” 

Let’s  try  it  out.  We  will  need  to  use  truth  tables  in  a  new  way:  to  check  an  argument. 
That  will  require  having  not  just  one  sentence,  but  several  on  the  truth  table.  Consider  an 
argument  that  looks  like  it  should  be  valid. 

If  Jupiter  is  more  massive  than  Earth,  then  Jupiter  has  a  stronger  gravitational 
field  than  Earth.  Jupiter  is  more  massive  than  Earth.  In  conclusion,  Jupiter  has  a 
stronger  gravitational  field  than  Earth. 

This  looks  like  it  has  the  form  of  a  valid  argument,  and  it  looks  like  an  astrophysicist  would 
tell  us  it  is  sound.  Let’s  translate  it  to  our  logical  language  using  the  following  translation 
key.  (We’ve  used  up  our  letters,  so  I’m  going  to  start  over.  We’ll  do  that  often:  assume  we 
are  starting  a  new  language  each  time  we  translate  a  new  set  of  problems  or  each  time  we 
consider  a  new  example.) 
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P:  Jupiter  is  more  massive  than  Earth 

Q:  Jupiter  has  a  stronger  gravitational  field  than  Earth. 

This  way  of  writing  out  sentences  of  logic  and  sentences  of  English  we  can  call  a 
“translation  key”.  We  can  use  this  format  whenever  we  want  to  explain  what  our  sentences 
mean  in  English. 

Using  this  key,  our  argument  would  be  formulated 

(P-Q) 

P 


Q 

That  short  line  is  not  part  of  our  language,  but  rather  is  a  handy  tradition.  When  quickly 
writing  down  arguments,  we  write  the  premises,  and  then  write  the  conclusion  last,  and 
draw  a  short  line  above  the  conclusion. 

This  is  an  argument:  it  is  an  ordered  list  of  sentences,  the  first  two  of  which  are  premises 
and  the  last  of  which  is  the  conclusion. 

To  make  a  truth  table,  we  identify  all  the  atomic  sentences  that  constitute  these  sentences. 
These  are  P  and  Q.  There  are  four  possible  kinds  of  ways  the  world  could  be  that  matter 
to  us  then: 


Well  write  out  the  sentences,  in  the  order  of  premises  and  then  conclusion. 


premise 

premise 

conclusion 

p 

Q 

(P-Q) 

P 

Q 

T 

T 

T 

F 

F 

T 

F 

F 
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Now  we  can  fill  in  the  columns  for  each  sentence,  identifying  the  truth  value  of  the 
sentence  for  that  kind  of  situation. 


premise 

premise 

conclusion 

p 

Q 

(P-Q) 

P 

Q 

T 

T 

T 

T 

T 

T 

F 

F 

T 

F 

F 

T 

T 

F 

T 

F 

F 

T 

F 

F 

We  know  how  to  fill  in  the  column  for  the  conditional  because  we  can  refer  back  to  the 
truth  table  used  to  define  the  conditional,  to  determine  what  its  truth  value  is  when  the  first 
part  and  second  part  are  true;  and  so  on.  P  is  true  in  those  kinds  of  situations  where  P  is 
true,  and  P  is  false  in  those  kinds  of  situations  where  P  is  false.  And  the  same  is  so  for  Q. 

Now,  consider  all  those  kinds  of  ways  the  world  could  be  such  that  all  the  premises  are 
true.  Only  the  first  row  of  the  truth  table  is  one  where  all  the  premises  are  true.  Note 
that  the  conclusion  is  true  in  that  row.  That  means,  in  any  kind  of  situation  in  which  all 
the  premises  are  true,  the  conclusion  will  be  true.  Or,  equivalently:  necessarily,  if  all  the 
premises  are  true,  then  the  conclusion  is  true. 


premise 

premise 

conclusion 

P 

Q 

(P-Q) 

P 

Q 

T 

T 

T 

T 

T 

T 

F 

F 

T 

F 

F 

T 

T 

F 

T 

F 

F 

T 

F 

F 

Consider  in  contrast  the  second  argument  above,  the  invalid  argument  with  all  true 
premises  and  a  true  conclusion.  Well  use  the  following  translation  key. 

R:  Miami  is  the  capital  of  Ontario 

S:  Miami  is  in  Canada 

And  our  argument  is  thus 

(R-S) 

iR 


■S 
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Here  is  the  truth  table. 


premise 

premise 

conclusion 

R 

S 

(R-S) 

-iR 

-iS 

T 

T 

T 

F 

F 

T 

F 

F 

F 

T 

F 

T 

T 

T 

F 

F 

F 

T 

T 

T 

Note  that  there  are  two  kinds  of  ways  that  the  world  could  be  in  which  all  of  our  premises 
are  true.  These  correspond  to  the  third  and  fourth  row  of  the  truth  table.  But  for  the 
third  row  of  the  truth  table,  the  premises  are  true  but  the  conclusion  is  false.  Yes,  there 
is  a  kind  of  way  the  world  could  be  in  which  all  the  premises  are  true  and  the  conclusion 
is  true;  that  is  shown  in  the  fourth  row  of  the  truth  table.  But  we  are  not  interested  in 
identifying  arguments  that  will  have  true  conclusions  if  we  are  lucky.  We  are  interested  in 
valid  arguments.  This  argument  is  invalid.  There  is  a  kind  of  way  the  world  could  be  such 
that  all  the  premises  are  true  and  the  conclusion  is  false.  We  can  highlight  this. 


premise 

premise 

conclusion 

R 

S 

(R-S) 

-.R 

-iS 

T 

T 

T 

F 

F 

T 

F 

F 

F 

T 

F 

T 

T 

T 

F 

F 

F 

T 

T 

T 

Hopefully  it  becomes  clear  why  we  care  about  validity.  Any  argument  of  the  form, 
(P-»Q)  and  P,  therefore  Q,  is  valid.  We  do  not  have  to  know  what  P  and  Q  mean  to 
determine  this.  Similarly,  any  argument  of  the  form,  (R^S)  and  -iR,  therefore  -iS,  is  invalid. 
We  do  not  have  to  know  what  R  and  S  mean  to  determine  this.  So  logic  can  be  of  equal 
use  to  the  astronomer  and  the  financier,  the  computer  scientist  or  the  sociologist. 

3.4  Returning  to  our  historical  example 

We  described  some  (not  all)  of  the  hypotheses  that  Semmelweis  tested  when  he  tried  to 
identify  the  cause  of  childbed  fever,  so  that  he  could  save  thousands  of  women  and  infants. 
Let  us  symbolize  these  and  consider  his  reasoning. 

The  first  case  we  considered  was  one  where  he  reasoned:  If  childbed  fever  is  caused  by 
cosmic-atmospheric-terrestrial  influences,  then  the  mortality  rate  would  be  similar  in  the 
First  and  Second  Clinics.  But  the  mortality  rate  was  not  similar  in  the  First  and  Second 
Clinics.  So,  the  childbed  fever  is  not  caused  by  cosmic-atmospheric-terrestrial  influences. 


Here  is  a  key  to  symbolize  the  argument. 
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T:  Childbed  fever  is  caused  by  cosmic-atmospheric-terrestrial  influences. 
U:  The  mortality  rate  is  similar  in  the  First  and  Second  Clinics. 

This  would  mean  the  argument  is: 

(T-U) 

-.U 


iT 

Is  this  argument  valid?  We  can  check  using  a  truth  table. 


premise 

premise 

conclusion 

T 

U 

(T-U) 

-.u 

-iT 

T 

T 

T 

F 

F 

T 

F 

F 

T 

F 

F 

T 

T 

F 

T 

F 

F 

T 

T 

T 

The  last  row  is  the  only  row  where  all  the  premises  are  true.  For  this  row,  the  conclusion  is 
true.  Thus,  for  all  the  kinds  of  ways  the  world  could  be  in  which  the  premises  are  true,  the 
conclusion  is  also  true.  This  is  a  valid  argument.  If  we  accept  his  premises,  then  we  should 
accept  that  childbed  fever  was  not  caused  by  cosmic-atmospheric-terrestrial  influences. 

The  second  argument  we  considered  was  the  concern  that  fear  caused  the  higher  mortality 
rates,  particularly  the  fear  of  the  priest  coming  to  deliver  last  rites.  Semmelweis  reasoned 
that  if  the  higher  rate  of  childbed  fever  is  caused  by  fear  of  death  resulting  from  the  priest’s 
approach,  then  the  rate  of  childbed  fever  should  decline  if  people  cannot  discern  when  the 
priest  is  coming  to  the  Clinic.  Here  is  a  key: 

V:  the  higher  rate  of  childbed  fever  is  caused  by  fear  of  death  resulting  from  the 
priest’s  approach. 

W:  the  rate  of  childbed  fever  will  decline  if  people  cannot  discern  when  the  priest 
is  coming  to  the  Clinic. 

But  when  Semmelweis  had  the  priest  silence  his  bell,  and  take  a  different  route,  so  that 
patients  could  not  discern  that  he  was  coming  to  the  First  Clinic,  he  found  no  difference  in 
the  mortality  rate;  the  First  Clinic  remained  far  worse  than  the  second  clinic.  He  concluded 
that  the  higher  rate  of  childbed  fever  was  not  caused  by  fear  of  death  resulting  from  the 
priest’s  approach. 
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(V-W) 

MV 


-V 

Is  this  argument  valid?  We  can  check  using  a  truth  table. 


premise 

premise 

conclusion 

V 

W 

(V-W) 

MV 

-.V 

T 

T 

T 

F 

F 

T 

F 

F 

T 

F 

F 

T 

T 

F 

T 

F 

F 

T 

T 

T 

Again,  we  see  that  Semmelweis’s  reasoning  was  good.  He  showed  that  it  was  not  the  case 
that  the  higher  rate  of  childbed  fever  was  caused  by  fear  of  death  resulting  from  the  Priest’s 
approach. 

What  about  Semmelweis’s  positive  conclusion,  that  the  higher  mortality  rate  was  caused 
by  some  contaminant  from  the  corpses  that  doctors  had  autopsied  just  before  they  assisted 
in  a  delivery?  To  understand  this  step  in  his  method,  we  need  to  reflect  a  moment  on  the 
scientific  method  and  its  relation  to  logic. 

3.5  Other  kinds  of  arguments  1:  Scientific  reasoning 

Valid  arguments,  and  the  methods  that  we  are  developing,  are  sometimes  called  “deductive 
reasoning”.  This  is  the  kind  of  reasoning  in  which  necessarily  our  conclusions  is  true  if  our 
premises  are  true;  these  arguments  can  be  shown  to  be  good  by  way  of  our  logical  reasoning 
alone.  There  are  other  kinds  of  reasoning,  and  understanding  this  may  help  clarify  the 
relation  of  logic  to  other  endeavors.  Two  important,  and  closely  related,  alternatives  to 
deductive  reasoning  are  scientific  reasoning  and  statistical  generalizations.  We’ll  discuss 
statistical  generalizations  in  the  next  section. 

Scientific  method  relies  upon  logic,  but  science  is  not  reducible  to  logic:  scientists  do 
empirical  research.  That  is,  they  examine  and  test  phenomena  in  the  world.  This  is  a 
very  important  difference  from  pure  logic.  To  understand  how  this  difference  results  in  a 
distinct  method,  let  us  review  Semmelweis’s  important  discovery. 

The  details  and  nature  of  scientific  reasoning  are  somewhat  controversial.  I  am  going  to 
provide  here  a  basic — many  philosophers  would  say,  oversimplified — account  of  scientific 
reasoning.  My  goal  is  to  indicate  the  relation  between  logic  and  the  kind  of  reasoning 
Semmelweis  may  have  used. 

As  we  noted,  Semmelweis  learned  about  the  death  of  a  colleague,  Professor  Jakob 
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Kolletschka.  Kolletschka  had  been  performing  an  autopsy,  and  he  cut  his  finger.  Shortly 
thereafter,  Kolletschka  died  with  symptoms  like  those  of  childbed  fever.  Semmelweis 
reasoned  that  something  on  the  corpse  caused  the  disease;  he  called  this  “cadaveric  matter”. 
In  the  First  Clinic,  where  the  mortality  rate  of  women  and  babies  was  high,  doctors  were 
doing  autopsies  and  then  delivering  babies  immediately  after.  If  he  could  get  this  cadaveric 
matter  oft  the  hands  of  the  doctors,  the  rate  of  childbed  fever  should  fall. 

So,  he  reasoned  thus:  if  the  fever  is  caused  by  cadaveric  matter  on  the  hands  of  the  doctors, 
then  the  mortality  rate  will  drop  when  doctors  wash  their  hands  with  chlorinated  water 
before  delivering  babies.  He  forced  the  doctors  to  do  this.  The  result  was  that  the  mortality 
rate  dropped  a  very  great  deal,  at  times  to  below  1%. 

Here  is  a  key: 

P:  The  fever  is  caused  by  cadaveric  matter  on  the  hands  of  the  doctors. 

Q:  The  mortality  rate  will  drop  when  doctors  wash  their  hands  with  chlorinated 
water  before  delivering  babies. 

And  the  argument  appears  to  be  something  like  this  (as  we  will  see,  this  isn’t  quite  the  right 
way  to  put  it,  but  for  now. . .): 

(P-Q) 

Q 


P 


Is  this  argument  valid?  We  can  check  using  a  truth  table. 


premise 

premise 

conclusion 

p 

Q 

(P-Q) 

Q 

P 

T 

T 

T 

T 

T 

T 

F 

F 

F 

T 

F 

T 

T 

T 

F 

F 

F 

T 

F 

F 

From  this,  it  looks  like  Semmelweis  has  used  an  invalid  argument! 

However,  an  important  feature  of  scientific  reasoning  must  be  kept  in  mind.  There  is 
some  controversy  over  the  details  of  the  scientific  method,  but  the  most  basic  view  goes 
something  like  this.  Scientists  formulate  hypotheses  about  the  possible  causes  or  features  of 
a  phenomenon.  They  make  predictions  based  on  these  hypotheses,  and  then  they  perform 
experiments  to  test  those  predictions.  The  reasoning  here  uses  the  conditional:  if  the 
hypotheses  is  true,  then  the  particular  prediction  will  be  true.  If  the  experiment  shows 
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that  the  prediction  is  false,  then  the  scientist  rejects  the  hypothesis.1-6-1  But  if  the  prediction 
proved  to  be  true,  then  the  scientist  has  shown  that  the  hypothesis  may  be  true — at  least, 
given  the  information  we  glean  from  the  conditional  and  the  consequent  alone. 

This  is  very  important.  Scientific  conclusions  are  about  the  physical  world,  they  are  not 
about  logic.  This  means  that  scientific  claims  are  not  necessarily  true,  in  the  sense  of 
“necessarily”  that  we  used  in  our  definition  of  “valid”.  Instead,  science  identifies  claims  that 
may  be  true,  or  (after  some  progress)  are  very  likely  to  be  true,  or  (after  very  much  progress) 
are  true. 

Scientists  keep  testing  their  hypotheses,  using  different  predictions  and  experiments.  Very 
often,  they  have  several  competing  hypotheses  that  have,  so  far,  survived  testing.  To  decide 
between  these,  they  can  use  a  range  of  criteria.  In  order  of  their  importance,  these  include: 
choose  the  hypothesis  with  the  most  predictive  power  (the  one  that  correctly  predicts  more 
kinds  of  phenomena);  choose  the  hypothesis  that  will  be  most  productive  of  other  scientific 
theories;  choose  the  hypothesis  consistent  with  your  other  accepted  hypotheses;  choose  the 
simplest  hypothesis. 

What  Semmelweis  showed  was  that  it  could  be  true  that  cadaveric  matter  caused  the 
childbed  fever.  This  hypothesis  predicted  more  than  any  other  hypothesis  that  the  doctors 
had,  and  so  for  that  reason  alone  this  was  the  very  best  hypothesis.  “But,”  you  might  reason, 
“doesn’t  that  mean  his  conclusion  was  true?  And  don’t  we  know  now,  given  all  that  we’ve 
learned,  that  his  conclusion  must  be  true?”  No.  He  was  far  ahead  of  other  doctors,  and 
his  deep  insights  were  of  great  service  to  all  of  humankind.  But  the  scientific  method 
continued  to  refine  Semmelweis’s  ideas.  For  example,  later  doctors  introduced  the  idea  of 
microorganisms  as  the  cause  of  childbed  fever,  and  this  refined  and  improved  Semmelweis’s 
insights:  it  was  not  because  the  cadaveric  matter  came  from  corpses  that  it  caused  the 
disease;  it  was  because  the  cadaveric  matter  contained  particular  micro-organisms  that  it 
caused  the  disease.  So,  further  scientific  progress  showed  his  hypothesis  could  be  revised 
and  improved. 

To  review  and  summarize,  with  the  scientific  method: 

1.  We  develop  a  hypothesis  about  the  causes  or  nature  of  a  phenomenon. 

2.  We  predict  what  (hopefully  unexpected)  effects  are  a  consequence  of  this  hypothesis. 

3.  We  check  with  experiments  to  see  if  these  predictions  come  true: 

•  If  the  predictions  prove  false,  we  reject  the  hypothesis;17-1 

•  If  the  predictions  prove  true,  we  conclude  that  the  hypothesis  could  be  true. 

We  continue  to  test  the  hypothesis  by  making  other  predictions  (that  is,  we 
return  to  step  2). 

This  means  that  a  hypothesis  that  does  not  make  testable  predictions  (that  is,  a  hypothesis 
that  cannot  possibly  be  proven  false)  is  not  a  scientific  hypothesis.  Such  a  hypothesis  is 
called  “unfalsifiable”  and  we  reject  it  as  unscientific. 

This  method  can  result  in  more  than  one  hypothesis  being  shown  to  be  possibly  true. 
Then,  we  chose  between  competing  hypotheses  by  using  criteria  like  the  following  (here 
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ordered  by  their  relative  importance;  “theory”  can  be  taken  to  mean  a  collection  of  one  or 
more  hypotheses): 

1.  Predictive  power:  the  more  that  a  hypothesis  can  successfully  predict,  the  better  it  is. 

2.  Productivity:  a 
preferred. 

3.  Coherence  with  Existing  Theory:  if  two  hypotheses  predict  the  same  amount  and  are 
equally  productive,  then  the  hypothesis  that  coheres  with  (does  not  contradict)  other 
successful  theories  is  preferable  to  one  that  does  contradict  them. 

4.  Simplicity:  if  two  hypotheses  are  equally  predictive,  productive,  and  coherent  with 
existing  theories,  then  the  simpler  hypothesis  is  preferable. 

Out  of  respect  to  Ignaz  Semmelweis  we  should  tell  the  rest  of  his  story,  although  it  means 
we  must  end  on  a  sad  note.  Semmelweis’s  great  accomplishment  was  not  respected  by 
his  colleagues,  who  resented  being  told  that  their  lack  of  hygiene  was  causing  deaths.  He 
lost  his  position  at  the  First  Clinic,  and  his  successors  stopped  the  program  of  washing 
hands  in  chlorinated  water.  The  mortality  rate  leapt  back  to  its  catastrophically  high  levels. 
Countless  women  and  children  died.  Semmelweis  continued  to  promote  his  ideas,  and 
this  caused  growing  resentment.  Eventually,  several  doctors  in  Vienna — not  one  of  them 
a  psychiatrist — secretly  signed  papers  declaring  Semmelweis  insane.  We  do  not  know 
whether  Semmelweis  was  mentally  ill  at  this  time.  These  doctors  took  him  to  an  asylum 
on  the  pretense  of  having  him  visit  in  his  capacity  as  a  doctor;  when  he  arrived,  the  guards 
seized  Semmelweis.  He  struggled,  and  the  guards  at  the  asylum  beat  him  severely,  put  him 
in  a  straightjacket,  and  left  him  alone  in  a  locked  room.  Neglected  in  isolation,  the  wounds 
from  his  beating  became  infected  and  he  died  a  week  later. 

It  was  years  before  Semmelweis’s  views  became  widely  accepted  and  his  accomplishment 
properly  recognized.  His  life  teaches  many  lessons,  including  unfortunately  that  even 
the  most  educated  among  us  can  be  evil,  petty,  and  willfully  ignorant.  Let  us  repay 
Semmelweis,  as  those  in  his  own  time  did  not,  by  remembering  and  praising  his  scientific 
acumen  and  humanity. 

3.6  Other  kinds  of  arguments  2:  Probability 

Here  we  can  say  a  few  words  about  statistical  generalizations — our  goal  being  only  to 
provide  a  contrast  with  deductive  reasoning. 

In  one  kind  of  statistical  generalization,  we  have  a  population  of  some  kind  that  we  want 
to  make  general  claims  about.  A  population  could  be  objects  or  events.  So,  a  population 
can  be  a  group  of  organisms,  or  a  group  of  weather  events.  “Population”  just  means  all 
the  events  or  all  the  things  we  want  to  make  a  generalization  about.  Often  however  it  is 
impossible  to  examine  every  object  or  event  in  the  population,  so  what  we  do  is  gather 
a  sample.  A  sample  is  some  portion  of  the  population.  Our  hope  is  that  the  sample  is 
representative  of  the  population:  that  whatever  traits  are  shared  by  the  members  of  the 
sample  are  also  shared  by  the  members  of  the  population. 

For  a  sample  to  representative,  it  must  be  random  and  large  enough.  “Random”  in  this 
context  means  that  the  sample  was  not  chosen  in  any  way  that  might  distinguish  members 
of  the  sample  from  the  population,  other  than  being  members  of  the  population.  In  other 


hypothesis  that  suggests  more  new  directions  for  research  is  to  be 
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words,  every  member  of  the  population  was  equally  likely  to  be  in  the  sample.  “Large 
enough’’  is  harder  to  define.  Statisticians  have  formal  models  describing  this,  but  suffice  to 
say  we  should  not  generalize  about  a  whole  population  using  just  a  few  members. 

Here’s  an  example.  We  wonder  if  all  domestic  dogs  are  descended  from  wolves.  Suppose 
we  have  some  genetic  test  to  identify  if  an  organism  was  a  descendent  of  wolves.  We 
cannot  give  the  test  to  all  domestic  dogs — this  would  be  impractical  and  costly  and 
unnecessary.  We  pick  a  random  sample  of  domestic  dogs  that  is  large  enough,  and  we 
test  them.  For  the  sample  to  be  random,  we  need  to  select  it  without  allowing  any  bias 
to  influence  our  selection;  all  that  should  matter  is  that  these  are  domestic  dogs,  and  each 
member  of  the  population  must  have  an  equal  chance  of  being  in  the  sample.  Consider  the 
alternative:  if  we  just  tested  one  family  of  dogs — say,  dogs  that  are  large — we  might  end 
up  selecting  dogs  that  differed  from  others  in  a  way  that  matters  to  our  test.  For  example, 
maybe  large  dogs  are  descended  from  wolves,  but  small  dogs  are  not.  Other  kinds  of  bias 
can  creep  in  less  obviously.  We  might  just  sample  dogs  in  our  local  community,  and  it 
might  just  be  that  people  in  our  community  prefer  large  dogs,  and  again  we  would  have  a 
sample  bias.  So,  we  randomly  select  dogs,  and  give  them  the  genetic  test. 

Suppose  the  results  were  positive.  We  reason  that  if  all  the  members  of  the  randomly 
selected  and  large  enough  sample  (the  tested  dogs)  have  the  trait,  then  it  is  very  likely  that  all 
the  members  of  the  population  (all  dogs)  have  the  trait.  Thus:  we  could  say  that  it  appears 
very  likely  that  all  dogs  have  the  trait.  (This  likelihood  can  be  estimated,  so  that  we  can  also 
sometimes  say  how  likely  it  is  that  all  members  of  the  population  have  the  trait.) 

This  kind  of  reasoning  obviously  differs  from  a  deductive  argument  very  substantially.  It 
is  a  method  of  testing  claims  about  the  world,  it  requires  observations,  and  its  conclusion  is 
likely  instead  of  being  certain. 

But  such  reasoning  is  not  unrelated  to  logic.  Deductive  reasoning  is  the  foundation  of  these 
and  all  other  forms  of  reasoning.  If  one  must  reason  using  statistics  in  this  way,  one  relies 
upon  deductive  methods  always  at  some  point  in  one’s  arguments.  There  was  a  conditional 
at  the  penultimate  step  of  our  reasoning,  for  example  (we  said  “if  all  the  members  of  the 
randomly  selected  and  large  enough  sample  have  the  trait,  then  it  is  very  likely  that  all  the 
members  of  the  population  have  the  trait”).  Furthermore,  the  foundations  of  these  methods 
(the  most  fundamental  descriptions  of  what  these  methods  are)  are  given  using  logic  and 
mathematics.  Logic,  therefore,  can  be  seen  as  the  study  of  the  most  fundamental  form  of 
reasoning,  which  will  be  used  in  turn  by  all  other  forms  of  reasoning,  including  scientific 
and  statistical  reasoning. 


3.7  Problems 

1.  Make  truth  tables  to  show  that  the  following  arguments  are  valid.  Circle  or  highlight 
the  rows  of  the  truth  table  that  show  the  argument  is  valid  (that  is,  all  the  rows  where 
all  the  premises  are  true).  Note  that  you  will  need  eight  rows  in  the  truth  table  for 
problems  b-d,  and  sixteen  rows  in  the  truth  table  for  problem  e. 

a.  Premises:  (P->Q),  -.Q.  Conclusion:  -iP. 

b.  Premises:  (P->Q),  (Q-R),  --R.  Conclusion:  -iP. 

c.  Premises:  (P-Q),  (Q-R),  P.  Conclusion:  R. 
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d.  Premises:  (P-Q),  (Q-R).  Conclusion:  (P-R). 

e.  Premises:  (P->Q),  (Q-R),  (R-S).  Conclusion:  (P-S). 

2.  Make  truth  tables  to  show  the  following  arguments  are  invalid.  Circle  or  highlight  the 
rows  of  the  truth  table  that  show  the  argument  is  invalid  (that  is,  any  row  where  all  the 
premises  are  true  but  the  conclusion  is  false). 


a. 

b. 


c. 

d. 


e. 


Premises:  (P-Q).  Q  Conclusion:  P. 

Premises:  (P-Q).  Conclusion:  (Q-P). 

Premises:  (P->Q),  (Q-R),  -ffl.  Conclusion:  ->R. 
Premises:  (P->Q),  (Q-R).  Conclusion:  (R-P). 
Premises:  (P-Q),  (Q-R),  (R-S).  Conclusion:  (S-P). 


3.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 

premises.  Your  argument  should  just  be  a  paragraph  (not  an  ordered  list  of  sentences  or 
anything  else  that  looks  like  logic).  Translate  it  into  propositional  logic  and  use  a  truth 
table  to  show  it  is  valid. 


4.  In  normal  colloquial  English,  write  your  own  invalid  argument  with  at  least  two 

premises.  Translate  it  into  propositional  logic  and  use  a  truth  table  to  show  it  is  invalid. 


[5]  All  the  data  cited  here  comes  from  Carter  (1983)  and  additional  biographical 
information  comes  from  Carter  and  Carter  (2008).  These  books  are  highly  recommended 
to  anyone  interested  in  the  history  of  science  or  medicine. 

[6]  It  would  be  more  accurate  to  say,  if  the  prediction  proves  false,  the  scientist  must  reject 
either  the  hypothesis  or  some  other  premise  of  her  reasoning.  For  example,  her  argument 
may  include  the  implicit  premise  that  her  scientific  instruments  were  operating  correctly. 
She  might  instead  reject  this  premise  that  her  instruments  are  working  correctly,  change 
one  of  her  instruments,  and  try  again  to  test  the  hypothesis.  See  Duhem  (1991).  Or,  to 
return  to  the  case  of  Semmelweis,  he  might  wonder  whether  he  sufficiently  established  that 
there  were  no  differences  in  the  atmosphere  between  the  two  clinics;  or  he  might  wonder 
whether  he  sufficiently  muffled  the  Priest’s  approach;  or  whether  he  recorded  his  results 
accurately;  and  so  on.  As  noted,  my  account  of  scientific  reasoning  here  is  simplified. 

[7]  Or,  as  noted  in  note  6,  we  reject  some  other  premise  of  the  argument. 


4.  Proofs 


4.1  A  problem  with  semantic  demonstrations  of  validity 

Given  that  we  can  test  an  argument  for  validity,  it  might  seem  that  we  have  a  fully 
developed  system  to  study  arguments.  However,  there  is  a  significant  practical  difficulty 
with  our  semantic  method  of  checking  arguments  using  truth  tables  (you  may  have  already 
noted  what  this  practical  difficulty  is,  when  you  did  problems  le  and  2e  of  chapter  3). 
Consider  the  following  argument: 

Alison  will  go  to  the  party. 

If  Alison  will  go  to  the  party,  then  Beatrice  will. 

If  Beatrice  will  go  to  the  party,  then  Cathy  will. 

If  Cathy  will  go  to  the  party,  then  Diane  will. 

If  Diane  will  go  to  the  party,  then  Elizabeth  will. 

If  Elizabeth  will  go  to  the  party,  then  Fran  will. 

If  Fran  will  go  to  the  party,  then  Giada  will. 

If  Giada  will  go  to  the  party,  then  Hilary  will. 

If  Hillary  will  go  to  the  party,  then  Io  will. 

If  Io  will  go  to  the  party,  then  Julie  will. 


Julie  will  go  to  the  party. 

Most  of  us  will  agree  that  this  argument  is  valid.  It  has  a  rather  simple  form,  in  which  one 
sentence  is  related  to  the  previous  sentence,  so  that  we  can  see  the  conclusion  follows  from 
the  premises.  Without  bothering  to  make  a  translation  key,  we  can  see  the  argument  has 
the  following  form. 

P 

(P-Q) 

(Q-R) 

(R-S) 

(S-T) 
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(T-U) 

(U-V) 

(v-»w) 

(w-»x) 

(X-Y) 


Y 

However,  if  we  are  going  to  check  this  argument,  then  the  truth  table  will  require  1024 
rows!  This  follows  directly  from  our  observation  that  for  arguments  or  sentences  composed 

of  n  atomic  sentences,  the  truth  table  will  require  2n  rows.  This  argument  contains  10 

atomic  sentences.  A  truth  table  checking  its  validity  must  have  210  rows,  and  210=1024. 
Furthermore,  it  would  be  trivial  to  extend  the  argument  for  another,  say,  ten  steps,  but 
then  the  truth  table  that  we  make  would  require  more  than  a  million  rows! 

For  this  reason,  and  for  several  others  (which  become  evident  later,  when  we  consider  more 
advanced  logic),  it  is  very  valuable  to  develop  a  syntactic  proof  method.  That  is,  a  way  to 
check  proofs  not  using  a  truth  table,  but  rather  using  rules  of  syntax. 

Here  is  the  idea  that  we  will  pursue.  A  valid  argument  is  an  argument  such  that,  necessarily, 
if  the  premises  are  true,  then  the  conclusion  is  true.  We  will  start  just  with  our  premises. 
We  will  set  aside  the  conclusion,  only  to  remember  it  as  a  goal.  Then,  we  will  aim  to  find 
a  reliable  way  to  introduce  another  sentence  into  the  argument,  with  the  special  property 
that,  if  the  premises  are  true,  then  this  single  additional  sentence  to  the  argument  must  also 
be  true.  If  we  could  find  a  method  to  do  that,  and  if  after  repeated  applications  of  this 
method  we  were  able  to  write  down  our  conclusion,  then  we  would  know  that,  necessarily, 
if  our  premises  are  true  then  the  conclusion  is  true. 

The  idea  is  more  clear  when  we  demonstrate  it.  The  method  for  introducing  new 
sentences  will  be  called  “inference  rules”.  We  introduce  our  first  inference  rules  for  the 
conditional.  Remember  the  truth  table  for  the  conditional: 


<t> 

W 

(4>^T) 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

Look  at  this  for  a  moment.  If  we  have  a  conditional  like  (P-»Q)  (looking  at  the  truth  table 
above,  remember  that  this  would  meant  that  we  let  be  P  and  VF  be  Q),  do  we  know 
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whether  any  other  sentence  is  true?  From  (P-»Q)  alone  we  do  not.  Even  if  (P-Q)  is  true, 
P  could  be  false  or  Q  could  be  false.  But  what  if  we  have  some  additional  information? 
Suppose  we  have  as  premises  both  (P-»Q)  and  P.  Then,  we  would  know  that  if  those 
premises  were  true,  Q  must  be  true.  We  have  already  checked  this  with  a  truth  table. 


premise 

premise 

p 

Q 

(P-Q) 

P 

Q 

T 

T 

T 

T 

T 

T 

F 

F 

T 

F 

F 

T 

T 

F 

T 

F 

F 

T 

F 

F 

The  first  row  of  the  truth  table  is  the  only  row  where  all  of  the  premises  are  true;  and  for 
it,  we  find  that  Q  is  true.  This,  of  course,  generalizes  to  any  conditional.  That  is,  we  have 
that: 


premise 

premise 

<5 

W 

($-T) 

<t> 

W 

T 

T 

T 

T 

T 

T 

F 

F 

T 

F 

F 

T 

T 

F 

T 

F 

F 

T 

F 

F 

We  now  capture  this  insight  not  using  a  truth  table,  but  by  introducing  a  rule.  The  rule 
we  will  write  out  like  this: 

(<D-»'P) 

$ 


W 

This  is  a  syntactic  rule.  It  is  saying  that  whenever  we  have  written  down  a  formula  in  our 
language  that  has  the  shape  of  the  first  row  (that  is,  whenever  we  have  a  conditional),  and 
whenever  we  also  have  written  down  a  formula  that  has  the  shape  in  the  second  row  (that 
is,  whenever  we  also  have  written  down  the  antecedent  of  the  conditional),  then  go  ahead, 
whenever  you  like,  and  write  down  a  formula  like  that  in  the  third  row  (the  consequent  of 
the  conditional).  The  rule  talks  about  the  shape  of  the  formulas,  not  their  meaning.  But  of 
course  we  justified  the  rule  by  looking  at  the  meanings. 

We  describe  this  by  saying  that  the  third  line  is  “derived”  from  the  earlier  two  lines  using 
the  inference  rule. 
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This  inference  rule  is  old.  We  are,  therefore,  stuck  with  its  well-established,  but  not  very 
enlightening,  name:  “modus  ponens”.  Thus,  we  say,  for  the  above  example,  that  the  third 
line  is  derived  from  the  earlier  two  lines  using  modus  ponens. 

4.2  Direct  proof 

We  need  one  more  concept:  that  of  a  proof.  Specifically,  well  start  with  the  most 
fundamental  kind  of  proof,  which  is  called  a  “direct  proof”.  The  idea  of  a  direct  proof  is: 
we  write  down  as  numbered  lines  the  premises  of  our  argument.  Then,  after  this,  we  can 
write  down  any  line  that  is  justified  by  an  application  of  an  inference  rule  to  earlier  lines  in 
the  proof.  When  we  write  down  our  conclusion,  we  are  done. 

Let  us  make  a  proof  of  the  simple  argument  above,  which  has  premises  (P-»Q)  and  P,  and 
conclusion  Q.  We  start  by  writing  down  the  premises  and  numbering  them.  There  is 
a  useful  bit  of  notation  that  we  can  introduce  at  this  point.  It  is  known  as  a  “Fitch  bar”, 
named  after  a  logician  Frederic  Fitch,  who  developed  this  technique.  We  will  write  a 
vertical  bar  to  the  left,  with  a  horizontal  line  indicating  that  the  premises  are  above  the  line. 

1.  (P  -a  Q) 

2.  P 


It  is  also  helpful  to  identify  where  these  steps  came  from.  We  can  do  that  with  a  little 
explanation  written  out  to  the  right. 

1.  (P  — >•  Q)  premise 

2.  P  premise 


Now,  we  are  allowed  to  write  down  any  line  that  follows  from  an  earlier  line  using  an 
inference  rule. 

1.  (P  — »■  Q)  premise 

2.  P  premise 

3.  Q 

And,  finally,  we  want  a  reader  to  understand  what  rule  we  used,  so  we  add  that  into  our 
explanation,  identifying  the  rule  and  the  lines  used. 

1.  (P  -a  Q) 

2.  P 


|  3.  Q 

That  is  a  complete  direct  proof. 


premise 

premise 

modus  ponens,  1,  2 
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Notice  a  few  things.  The  numbering  of  each  line,  and  the  explanations  to  the  right, 
are  bookkeeping;  they  are  not  part  of  our  argument,  but  rather  are  used  to  explain  our 
argument.  However,  always  do  them  because,  it  is  hard  to  understand  a  proof  without 
them.  Also,  note  that  our  idea  is  that  the  inference  rule  can  be  applied  to  any  earlier  line, 
including  lines  themselves  derived  using  inference  rules.  It  is  not  just  premises  to  which  we 
can  apply  an  inference  rule.  Finally,  note  that  we  have  established  that  this  argument  must 
be  valid.  From  the  premises,  and  an  inference  rule  that  preserves  validity,  we  have  arrived 
at  the  conclusion.  Necessarily,  the  conclusion  is  true,  if  the  premises  are  true. 

The  long  argument  that  we  started  the  chapter  with  can  now  be  given  a  direct  proof. 


1.  p 

premise 

2.  (P  -A  Q) 

premise 

3.  (Q  -A  R) 

premise 

4.  (R  -a  S) 

premise 

5.  (S  -4  T) 

premise 

6.  (T  -a  U) 

premise 

7.  (V  -4  W) 

premise 

8.  (W  -4  X) 

premise 

9.  (X  -4  Y) 

premise 

10.  (Y  -a  X) 

premise 

11.  Q 

modus  ponens,  2,  1 

12.  R 

modus  ponens,  3,  11 

13.  S 

modus  ponens,  4,  12 

14.  T 

modus  ponens,  5,  13 

15.  U 

modus  ponens,  6,  14 

16.  V 

modus  ponens,  7,  15 

17.  W 

modus  ponens,  8,  16 

18.  X 

modus  ponens,  9,  17 

19.  Y 

modus  ponens,  10,  18 

From  repeated  applications  of  modus  ponens,  we  arrived  at  the  conclusion.  If  lines  1 
through  10  are  true,  line  19  must  be  true.  The  argument  is  valid.  And,  we  completed  it 
with  19  steps,  as  opposed  to  writing  out  1024  rows  of  a  truth  table. 

We  can  see  now  one  of  the  very  important  features  of  understanding  the  difference 
between  syntax  and  semantics.  Our  goal  is  to  make  the  syntax  of  our  language  perfectly 
mirror  its  semantics.  By  manipulating  symbols,  we  manage  to  say  something  about  the 
world.  This  is  a  strange  fact,  one  that  underlies  one  of  the  deeper  possibilities  of  language, 
and  also,  ultimately,  of  computers. 
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4.3  Other  inference  rules 

We  can  now  introduce  other  inference  rules.  Looking  at  the  truth  table  for  the  conditional 
again,  what  else  do  we  observe?  Many  have  noted  that  if  the  consequent  of  a  conditional 
is  false,  and  the  conditional  is  true,  then  the  antecedent  of  the  conditional  must  be  false. 
Written  out  as  a  semantic  check  on  arguments,  this  will  be: 


premise 

premise 

<t> 

W 

-AF 

-.$ 

T 

T 

T 

F 

F 

T 

F 

F 

T 

F 

F 

T 

T 

F 

T 

F 

F 

T 

T 

T 

(Remember  how  we  have  filled  out  the  truth  table.  We  referred  to  those  truth  tables  used 
to  define  and  “i”,  and  then  for  each  row  of  this  table  above,  we  filled  out  the  values  in 
each  column  based  on  that  definition.) 

What  we  observe  from  this  truth  table  is  that  when  both  ($^VF)  and  -AF  are  true,  then  -i$  is 
true.  Namely,  this  can  be  seen  in  the  last  row  of  the  truth  table. 

This  rule,  like  the  last,  is  old,  and  has  a  well-established  name:  “modus  tollens”.  We 
represent  it  schematically  with 

($^vp) 

-AF 


What  about  negation?  If  we  know  a  sentence  is  false,  then  this  fact  alone  does  not  tell  us 
about  any  other  sentence.  But  what  if  we  consider  a  negated  negation  sentence?  Such  a 
sentence  has  the  following  truth  table. 


<t> 

-1-1$ 

T 

T 

F 

F 

We  can  introduce  a  rule  that  takes  advantage  of  this  observation.  In  fact,  it  is  traditional 
to  introduce  two  rules,  and  lump  them  together  under  a  common  name.  The  rules’  name 
is  “double  negation”.  Basically,  the  rule  says  we  can  add  or  take  away  two  negations  any 
time.  Here  are  the  two  schemas  for  the  two  rules: 
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$ 


-1-1$ 


and 


-i-i$ 


$ 

Finally,  it  is  sometimes  helpful  to  be  able  to  repeat  a  line.  Technically,  this  is  an  unnecessary 
rule,  but  if  a  proof  gets  long,  we  often  find  it  easier  to  understand  the  proof  if  we  write  a 
line  over  again  later  when  we  find  we  need  it  again.  So  we  introduce  the  rule  “repeat”. 

$ 


$ 


4.4  An  example 

Here  is  an  example  that  will  make  use  of  all  three  rules.  Consider  the  following  argument: 

(Q-P) 

(-.Q-*R) 

iR 


P 

We  want  to  check  this  argument  to  see  if  it  is  valid. 

To  do  a  direct  proof,  we  number  the  premises  so  that  we  can  refer  to  them  when  using 
inference  rules. 
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1.  (Q  -A  P) 

2.  (-nQ  — >  R) 

3.  -iR 


premise 

premise 

premise 


And,  now,  we  apply  our  inference  rules.  Sometimes,  it  can  be  hard  to  see  how  to  complete 
a  proof.  In  the  worst  case,  where  you  are  uncertain  of  how  to  proceed,  you  can  apply  all  the 
rules  that  you  see  are  applicable  and  then,  assess  if  you  have  gotten  closer  to  the  conclusion; 
and  repeat  this  process.  Here  in  any  case  is  a  direct  proof  of  the  sought  conclusion. 


1.  (Q  ->  P) 

2.  (— i Q  -A  R) 

3.  -iR 


premise 

premise 

premise 


4.  — Q 

5.  Q 

6.  P 


modus  tollens,  2,  3 
double  negation,  4 
modus  ponens,  1,  5 


Developing  skill  at  completing  proofs  merely  requires  practice.  You  should  strive  to  do  as 
many  problems  as  you  can. 


4.5  Problems 


1.  Complete  a  direct  derivation  (also  called  a  “direct  proof”)  for  each  of  the  following 
arguments,  showing  that  it  is  valid.  You  will  need  the  rules  modus  ponens,  modus 
tollens,  and  double  negation. 

a.  Premises:  iQ,  (iQ^S).  Show:  S. 

b.  Premises:  (S  -  -.Q),  (P  -  S),  -P.  Show:  -iQ. 

c.  Premises:  (T  ->  P),  (Q  -  S),  (S  -  T),  -,P.  Show:  -iQ. 

d.  Premises:  R,  P,  (P  -»  (R  ->  Q)).  Show:  Q. 

e.  Premises:  ((R  -»  S)  -»  Q),  -,Q,  (-.(R  -  S)  ->  V).  Show:  V. 

f.  Premises:  (P->(Q-R)),n(Q-»R).  Show:  -iP. 

g.  Premises:  (->(Q  -»  R)  -fP),  ->P,  Q.  Show:  R. 

h.  Premises:  P,  (P  -  R),  (P  -  (R  -»  Q)).  Show:  Q. 


2.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 

premises.  Your  argument  should  just  be  a  paragraph  (not  an  ordered  list  of  sentences  or 
anything  else  that  looks  like  logic).  Translate  it  into  propositional  logic  and  use  a 
direct  proof  to  show  it  is  valid. 


3.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  three 
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premises.  Your  argument  should  just  be  a  paragraph  (not  an  ordered  list  of  sentences  or 
anything  else  that  looks  like  logic).  Translate  it  into  propositional  logic  and  use  a 
direct  proof  to  show  it  is  valid. 


4.  Make  your  own  key  to  translate  into  propositional  logic  the  portions  of  the  following 
argument  that  are  in  bold.  Using  a  direct  proof,  prove  that  the  resulting  argument  is 
valid. 


Inspector  Tarski  told  his  assistant,  Mr.  Carroll,  “If  Wittgenstein  had  mud  on 
his  boots,  then  he  was  in  the  field.  Furthermore,  if  Wittgenstein  was  in  the 
field,  then  he  is  the  prime  suspect  for  the  murder  of  Dodgson.  Wittgenstein 
did  have  mud  on  his  boots.  We  conclude,  Wittgenstein  is  the  prime  suspect 
for  the  murder  of  Dodgson.” 


5.  "And" 


5.1  The  conjunction 

To  make  our  logical  language  more  easy  and  intuitive  to  use,  we  can  now  add  to  it  elements 
that  make  it  able  to  express  the  equivalents  of  other  sentences  from  a  natural  language  like 
English.  Our  translations  will  not  be  exact,  but  they  will  be  close  enough  that:  first,  we  will 
have  a  way  to  more  quickly  understand  the  language  we  are  constructing;  and,  second,  we 
will  have  a  way  to  speak  English  more  precisely  when  that  is  required  of  us. 

Consider  the  following  expressions.  How  would  we  translate  them  into  our  logical 
language? 

Tom  will  go  to  Berlin  and  Paris. 

The  number  a  is  evenly  divisible  by  2  and  3. 

Steve  is  from  Texas  but  not  from  Dallas. 

We  could  translate  each  of  these  using  an  atomic  sentence.  But  then  we  would  have 
lost — or  rather  we  would  have  hidden — information  that  is  clearly  there  in  the  English 
sentences.  We  can  capture  this  information  by  introducing  a  new  connective;  one  that 
corresponds  to  our  “and”. 

To  see  this,  consider  whether  you  will  agree  that  these  sentences  above  are  equivalent  to 
the  following  sentences. 

Tom  will  go  to  Berlin  and  Tom  will  go  to  Paris. 

The  number  a  is  evenly  divisible  by  2  and  the  number  a  is  evenly  divisible  by  3. 

Steve  is  from  Texas  and  it  is  not  the  case  that  Steve  is  from  Dallas. 

Once  we  grant  that  these  sentences  are  equivalent  to  those  above,  we  see  that  we  can  treat 
the  “and”  in  each  sentence  as  a  truth  functional  connective. 

Suppose  we  assume  the  following  key. 

P:  Tom  will  go  to  Berlin. 

Q:  Tom  will  go  to  Paris. 

R:  a  is  evenly  divisible  by  2. 

S:  a  is  evenly  divisible  by  3. 

T:  Steve  is  from  Texas 
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U:  Steve  is  from  Dallas. 

A  partial  translation  of  these  sentences  would  then  be: 

P  and  Q 
R  and  S 

T  and  -.U 

Our  third  sentence  above  might  generate  some  controversy.  How  should  we  understand 
“but”?  Consider  that  in  terms  of  the  truth  value  of  the  connected  sentences,  “but”  is  the 
same  as  “and”.  That  is,  if  you  say  “P  but  Q”  you  are  asserting  that  both  P  and  Q  are  true. 
However,  in  English  there  is  extra  meaning;  the  English  “but”  seems  to  indicate  that  the 
additional  sentence  is  unexpected  or  counter-intuitive.  “P  but  Q”  seems  to  say,  “P  is  true, 
and  you  will  find  it  surprising  or  unexpected  that  Q  is  true  also.”  That  extra  meaning  is  lost 
in  our  logic.  We  will  not  be  representing  surprise  or  expectations.  So,  we  can  treat  “but”  as 
being  the  same  as  “and”.  This  captures  the  truth  value  of  the  sentence  formed  using  “but”, 
which  is  all  that  we  require  of  our  logic. 

Following  our  method  up  until  now,  we  want  a  symbol  to  stand  for  “and”.  In  recent  years 
the  most  commonly  used  symbol  has  been  “A”. 

The  syntax  for  “A”  is  simple.  If  $  and  VF  are  sentences,  then 

($Aq/) 

is  a  sentence.  Our  translations  of  our  three  example  sentences  should  thus  look  like  this: 

(PAQ) 

(RAS) 

(TA-iU) 

Each  of  these  is  called  a  “conjunction”.  The  two  parts  of  a  conjunction  are  called 
“conjuncts”. 

The  semantics  of  the  conjunction  are  given  by  its  truth  table.  Most  people  find  the 
conjunction’s  semantics  obvious.  If  I  claim  that  both  $  and  lE  are  true,  normal  usage 
requires  that  if  $  is  false  or  W  is  false,  or  both  are  false,  then  I  spoke  falsely  also. 

Consider  an  example.  Suppose  your  employer  says,  “After  one  year  of  employment  you 
will  get  a  raise  and  two  weeks  vacation”.  A  year  passes.  Suppose  now  that  this  employer 
gives  you  a  raise  but  no  vacation,  or  a  vacation  but  no  raise,  or  neither  a  raise  nor  a  vacation. 
In  each  case,  the  employer  has  broken  his  promise.  The  sentence  forming  the  promise 
turned  out  to  be  false. 
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Thus,  the  semantics  for  the  conjunction  are  given  with  the  following  truth  table.  For  any 
sentences  $  and  W: 


T 

($AT) 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

F 

5.2  Alternative  phrasings,  and  a  different  "and" 

We  have  noted  that  in  English,  “but”  is  an  alternative  to  “and”,  and  can  be  translated  the 
same  way  in  our  propositional  logic.  There  are  other  phrases  that  have  a  similar  meaning: 
they  are  best  translated  by  conjunctions,  but  they  convey  (in  English)  a  sense  of  surprise  or 
failure  of  expectations.  For  example,  consider  the  following  sentence. 

Even  though  they  lost  the  battle,  they  won  the  war. 

Here  “even  though”  seems  to  do  the  same  work  as  “but”.  The  implication  is  that  it  is 
surprising — that  one  might  expect  that  if  they  lost  the  battle  then  they  lost  the  war.  But,  as 
we  already  noted,  we  will  not  capture  expectations  with  our  logic.  So,  we  would  take  this 
sentence  to  be  sufficiently  equivalent  to: 

They  lost  the  battle  and  they  won  the  war. 

With  the  exception  of  “but”,  it  seems  in  English  there  is  no  other  single  word  that  is  an 
alternative  to  “and”  that  means  the  same  thing.  However,  there  are  many  ways  that  one 
can  imply  a  conjunction.  To  see  this,  consider  the  following  sentences. 

Tom,  who  won  the  race,  also  won  the  championship. 

The  star  Phosphorous,  that  we  see  in  the  morning,  is  the  Evening  Star. 

The  Evening  Star,  which  is  called  “Hesperus”,  is  also  the  Morning  Star. 

While  Steve  is  tall,  Tom  is  not. 

Dogs  are  vertebrate  terrestrial  mammals. 

Depending  on  what  elements  we  take  as  basic  in  our  language,  these  sentences  all  include 
implied  conjunctions.  They  are  equivalent  to  the  following  sentences,  for  example: 

Tom  won  the  race  and  Tom  won  the  championship. 

Phosphorous  is  the  star  that  we  see  in  the  morning  and  Phosphorous  is  the 
Evening  Star. 

The  Evening  Star  is  called  “Hesperus”  and  the  Evening  Star  is  the  Morning  Star. 
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Steve  is  tall  and  it  is  not  the  case  that  Tom  is  tall. 

Dogs  are  vertebrates  and  dogs  are  terrestrial  and  dogs  are  mammals. 

Thus,  we  need  to  be  sensitive  to  complex  sentences  that  are  conjunctions  but  that  do  not 
use  “and”  or  “but”  or  phrases  like  “even  though”. 

Unfortunately,  in  English  there  are  some  uses  of  “and”  that  are  not  conjunctions.  The  same 
is  true  for  equivalent  terms  in  some  other  natural  languages.  Here  is  an  example. 

Rochester  is  between  Buffalo  and  Albany. 

The  “and”  in  this  sentence  is  not  a  conjunction.  To  see  this,  note  that  this  sentence  is  not 
equivalent  to  the  following: 

Rochester  is  between  Buffalo  and  Rochester  is  between  Albany. 

That  sentence  is  not  even  semantically  correct.  What  is  happening  in  the  original  sentence? 

The  issue  here  is  that  “is  between”  is  what  we  call  a  “predicate”.  We  will  learn  about 
predicates  in  chapter  11,  but  what  we  can  say  here  is  that  some  predicates  take  several  names 
in  order  to  form  a  sentence.  In  English,  if  a  predicate  takes  more  than  two  names,  then 
we  typically  use  the  “and”  to  combine  names  that  are  being  described  by  that  predicate.  In 
contrast,  the  conjunction  in  our  propositional  logic  only  combines  sentences.  So,  we  must 
say  that  there  are  some  uses  of  the  English  “and”  that  are  not  equivalent  to  our  conjunction. 

This  could  be  confusing  because  sometimes  in  English  we  put  “and”  between  names  and 
there  is  an  implied  conjunction.  Consider: 

Steve  is  older  than  Joe  and  Karen. 

Superficially,  this  looks  to  have  the  same  structure  as  “Rochester  is  between  Buffalo  and 
Albany”.  But  this  sentence  really  is  equivalent  to: 

Steve  is  older  than  Joe  and  Steve  is  older  than  Karen. 

The  difference,  however,  is  that  there  must  be  three  things  in  order  for  one  to  be  between 
the  other  two.  There  need  only  be  two  things  for  one  to  be  older  than  the  other.  So,  in  the 
sentence  “Rochester  is  between  Buffalo  and  Albany”,  we  need  all  three  names  (“Rochester”, 
“Buffalo”,  and  “Albany)  to  make  a  single  proper  atomic  sentence  with  “between”.  This  tells 
us  that  the  “and”  is  just  being  used  to  combine  these  names,  and  not  to  combine  implied 
sentences  (since  there  can  be  no  implied  sentence  about  what  is  “between”,  using  just  two 
or  just  one  of  these  names). 

That  sounds  complex.  Do  not  despair,  however.  The  use  of  “and”  to  identify  names  being 
used  by  predicates  is  less  common  than  “and”  being  used  for  a  conjunction.  Also,  after  we 
discuss  predicates  in  chapter  11,  and  after  you  have  practiced  translating  different  kinds  of 
sentences,  the  distinction  between  these  uses  of  “and”  will  become  easy  to  identify  in  almost 
all  cases.  In  the  meantime,  we  shall  pick  examples  that  do  not  invite  this  confusion. 
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5.3  Inference  rules  for  conjunctions 

Looking  at  the  truth  table  for  the  conjunction  should  tell  us  two  things  very  clearly.  First, 
if  a  conjunction  is  true,  what  else  must  be  true?  The  obvious  answer  is  that  both  of  the 
parts,  the  conjuncts,  must  be  true.  We  can  introduce  a  rule  to  capture  this  insight.  In  fact, 
we  can  introduce  two  rules  and  call  them  by  the  same  name,  since  the  order  of  conjuncts 
does  not  affect  their  truth  value.  These  rules  are  often  called  “simplification”. 

($AVJ/) 


$ 


And: 


W 

In  other  words,  if  (4>AVE)  is  true,  then  $  must  be  true;  and  if  (5>A'T)  is  true,  then  W  must  be 
true. 

We  can  also  introduce  a  rule  to  show  a  conjunction,  based  on  what  we  see  from  the  truth 
table.  That  is,  it  is  clear  that  there  is  only  one  kind  of  condition  in  which  (<f>AVF)  is  true, 
and  that  is  when  $  is  true  and  when  W  is  true.  This  suggests  the  following  rule: 

$ 

W 


($A^T) 

We  might  call  this  rule  “conjunction”,  but  to  avoid  confusion  with  the  name  of  the 
sentences,  we  will  call  this  rule  “adjunction”. 

5.4  Reasoning  with  conjunctions 

It  would  be  helpful  to  consider  some  examples  of  reasoning  with  conjunctions.  Let’s  begin 
with  an  argument  in  a  natural  language. 

Tom  and  Steve  will  go  to  London.  If  Steve  goes  to  London,  then  he  will  ride  the 
Eye.  Tom  will  ride  the  Eye  too,  provided  that  he  goes  to  London.  So,  both  Steve 
and  Tom  will  ride  the  Eye. 


We  need  a  translation  key. 
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T:  Tom  will  go  to  London. 
S:  Steve  will  go  to  London. 
U:  Tom  will  ride  the  Eye. 
V :  Steve  will  ride  the  Eye. 
Thus  our  argument  is: 

(TAS) 

(S-U) 

(T-V) 


(VAU) 

Our  direct  proof  will  look  like  this. 


1.  (T  AS) 

premise 

2.  (S  U) 

premise 

3.  (T  -a  V) 

premise 

4.  T 

simplification,  1 

5.  V 

modus  ponens,  3,  4 

6.  S 

simplification,  1 

7.  U 

modus  ponens,  2,  6 

8.  (VAU) 

adjunction,  5,  7 

Now  an  example  using  just  our  logical  language.  Consider  the  following  argument. 

(Q— s) 

(P-(QAR)) 

(T— R) 

P 


(-iSAiT) 
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Here  is  one  possible  proof. 


1.  (Q  -a  -S) 

premise 

2.  (P  -a  (Q  A  R)) 

premise 

3.  (T  — >  -iR) 

premise 

4.  P 

premise 

5.  (Q  A  R) 

modus  ponens,  2,  4 

6.  Q 

simplification,  5 

7.  — 'S 

modus  ponens,  1,  6 

8.  R 

simplification,  5 

9.  — -R 

double  negation,  8 

10.  -iT 

modus  tollens,  3,  9 

11.  (— i 'S  A  - 1 T) 

adjunction,  7,  10 

5.5  Alternative  symbolizations  for  the  conjunction 

Alternative  notations  for  the  conjunction  include  the  symbols  and  the  symbol  “ 
Thus,  the  expression  (PAQ)  would  be  written  in  these  different  styles,  as: 

(P&Q) 

(P-Q) 


5.6  Complex  sentences 

Now  that  we  have  three  different  connectives,  this  is  a  convenient  time  to  consider 
complex  sentences.  The  example  that  we  just  considered  required  us  to  symbolize  complex 
sentences,  which  use  several  different  kinds  of  connectives.  We  want  to  avoid  confusion 
by  being  clear  about  the  nature  of  these  sentences.  We  also  want  to  be  able  to  understand 
when  such  sentences  are  true  and  when  they  are  false.  These  two  goals  are  closely  related. 

Consider  the  following  sentences. 

-(p-Q) 

(-P-Q) 

(-P— Q) 

We  want  to  understand  what  kinds  of  sentences  these  are,  and  also  when  they  are  true 
and  when  they  are  false.  (Sometimes  people  wrongly  assume  that  there  is  some  simple 
distribution  law  for  negation  and  conditionals,  so  there  is  some  additional  value  to 
reviewing  these  particular  examples.)  The  first  task  is  to  determine  what  kinds  of  sentences 
these  are.  If  the  first  symbol  of  your  expression  is  a  negation,  then  you  know  the  sentence  is 
a  negation.  The  first  sentence  above  is  a  negation.  If  the  first  symbol  of  your  expression  is 
a  parenthesis,  then  for  our  logical  language  we  know  that  we  are  dealing  with  a  connective 
that  combines  two  sentences. 
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The  way  to  proceed  is  to  match  parentheses.  Generally  people  are  able  to  do  this  by  eye, 
but  if  you  are  not,  you  can  use  the  following  rule.  Moving  left  to  right,  the  last  “(”  that  you 
encounter  always  matches  the  first  “)”  that  you  encounter.  These  form  a  sentence  that  must 
have  two  parts  combined  with  a  connective.  You  can  identify  the  two  parts  because  each 
will  be  an  atomic  sentence,  a  negation  sentence,  or  a  more  complex  sentence  bound  with 
parentheses  on  each  side  of  the  connective. 

In  our  propositional  logic,  each  set  of  paired  parentheses  forms  a  sentence  of  its  own.  So, 
when  we  encounter  a  sentence  that  begins  with  a  parenthesis,  we  find  that  if  we  match 
the  other  parentheses,  we  will  ultimately  end  up  with  two  sentences  as  constituents,  one 
on  each  side  of  a  single  connective.  The  connective  that  combines  these  two  parts  is  called 
the  “main  connective”,  and  it  tells  us  what  kind  of  sentence  this  is.  Thus,  above  we  have 
examples  of  a  negation,  a  conditional,  and  a  conditional. 

How  should  we  understand  the  meaning  of  these  sentences?  Here  we  can  use  truth  tables 
in  a  new,  third  way  (along  with  defining  a  connective  and  checking  arguments).  Our 
method  will  be  this. 

First,  write  out  the  sentence  on  the  right,  leaving  plenty  of  room.  Identify  what  kind  of 
sentence  this  is.  If  it  is  a  negation  sentence,  you  should  add  just  to  the  left  a  column  for 
the  non-negated  sentence.  This  is  because  the  truth  table  defining  negation  tells  us  what  a 
negated  sentence  means  in  relation  to  the  non-negated  sentence  that  forms  the  sentence.  If 
the  sentence  is  a  conditional,  make  two  columns  to  the  left,  one  for  the  antecedent  and  one 
for  the  consequent.  If  the  sentence  is  a  conjunction,  make  two  columns  to  the  left,  one  for 
each  conjunct.  Here  again,  we  do  this  because  the  semantic  definitions  of  these  connectives 
tell  us  what  the  truth  value  of  the  sentence  is,  as  a  function  of  the  truth  value  of  its  two 
parts.  Continue  this  process  until  the  parts  would  be  atomic  sentences.  Then,  we  stipulate 
all  possible  truth  values  for  the  atomic  sentences.  Once  we  have  done  this,  we  can  fill  out 
the  truth  table,  working  left  to  right. 

Let’s  try  it  for  -i(P->Q).  We  write  it  to  the  right. 


fyP-Q) 


This  is  a  negation  sentence,  so  we  write  to  the  left  the  sentence  being  negated. 


(P-Q) 

fyP-Q) 
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This  sentence  is  a  conditional.  Its  two  parts  are  atomic  sentences.  We  put  these  to  the 
left  of  the  dividing  line,  and  we  stipulate  all  possible  combinations  of  truth  values  for  these 
atomic  sentences. 


p 

Q 

(P-Q) 

-,(p_>Q) 

T 

T 

T 

F 

F 

T 

F 

F 

Now,  we  can  fill  out  each  column,  moving  left  to  right.  We  have  stipulated  the  values  for 
P  and  Q,  so  we  can  identify  the  possible  truth  values  of  (P->Q).  The  semantic  definition 
for  tells  us  how  to  do  that,  given  that  we  know  for  each  row  the  truth  value  of  its  parts. 


p 

<2 

(P-Q) 

-(P-Q) 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

This  column  now  allows  us  to  fill  in  the  last  column.  The  sentence  in  the  last  column  is  a 
negation  of(P-Q)  ,  so  the  definition  of  “T’  tell  us  that  -i  (P-Q)  is  true  when  (P-Q)  is  false, 
and  -,(P_>Q)  is  false  when  (P-Q)  is  true. 


P 

Q 

(P- 

♦Q) 

-dP-Q) 

T 

T 

T 

F 

T 

F 

F 

T 

F 

T 

T 

F 

F 

F 

T 

F 

This  truth  table  tells  us  what  -1(P_>Q)  means  in  our  propositional  logic.  Namely,  if  we 
assert  ->  (P-Q)  we  are  asserting  that  P  is  true  and  Q  is  false. 

We  can  make  similar  truth  tables  for  the  other  sentences. 


P 

Q 

-.P 

(-.P— Q) 

T 

T 

F 

T 

T 

F 

F 

T 

F 

T 

T 

T 

F 

F 

T 

F 
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How  did  we  make  this  table?  The  sentence  (-P-Q)  is  a  conditional  with  two  parts, 
-iP  and  Q.  Because  Q  is  atomic,  it  will  be  on  the  left  side.  We  make  a  row  for  -iP.  The 
sentence  -iP  is  a  negation  of  P,  which  is  atomic,  so  we  put  P  also  on  the  left.  We  fill  in  the 
columns,  going  left  to  right,  using  our  definitions  of  the  connectives. 

And: 


p 

£ 

-.P 

ffP— Q) 

T 

T 

F 

— 

F 

T 

T 

F 

F 

T 

T 

F 

T 

T 
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F 

F 

F 

T 

T 

T 

Such  a  truth  table  is  very  helpful  in  determining  when  sentences  are,  and  are  not, 
equivalent.  We  have  used  the  concept  of  equivalence  repeatedly,  but  have  not  yet  defined 
it.  We  can  offer  a  semantic,  and  a  syntactic,  explanation  of  equivalence.  The  semantic 
notion  is  relevant  here:  we  say  two  sentences  $  and  T7  are  “equivalent”  or  “logically 
equivalent”  when  they  must  have  the  same  truth  value.  (For  the  syntactic  concept  of 
equivalence,  see  section  9.2).  These  truth  tables  show  that  these  three  sentences  are  not 
equivalent,  because  it  is  not  the  case  that  they  must  have  the  same  truth  value.  For  example, 
if  P  and  Q  are  both  true,  then  -(P-Q)  is  false  but  (-iP-»Q)  is  true  and  (-.P-»-.Q)  is  true. 
If  P  is  false  and  Q  is  true,  then  (-iP->Q)  is  true  but  (-.P-»-.Q)  is  false.  Thus,  each  of  these 
sentences  is  true  in  some  situation  where  one  of  the  others  is  false.  No  two  of  them  are 
equivalent. 

We  should  consider  an  example  that  uses  conjunction,  and  which  can  help  in  some 
translations.  How  should  we  translate  “Not  both  Steve  and  Tom  will  go  to  Berlin”?  This 
sentence  tells  us  that  it  is  not  the  case  that  both  Steve  will  go  to  Berlin  and  Tom  will  go 
to  Berlin.  The  sentence  does  allow,  however,  that  one  of  them  will  go  to  Berlin.  Thus, 
let  U  mean  Sieve  will  go  to  Berlin  and  V  mean  Tom  will  go  to  Berlin.  Then  we  should 
translate  this  sentence,  i(UAV).  We  should  not  translate  the  sentence  (iUAiV).  To  see 
why,  consider  their  truth  tables. 


U 

V 

C 

> 

< 

-.(UAV) 

-.u 

W 

J 

c 

> 
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T 

T 
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We  can  see  that  i(UAV)  and  (iUAiV)  are  not  equivalent.  Also,  note  the  following.  Both 
i(UAV)  and  (iUA-iV)  are  true  if  Steve  does  not  go  to  Berlin  and  Tom  does  not  go  to  Berlin. 
This  is  captured  in  the  last  row  of  this  truth  table,  and  this  is  consistent  with  the  meaning 
of  the  English  sentence.  But,  now  note:  it  is  true  that  not  both  Steve  and  Tom  will  go  to 
Berlin,  if  Steve  goes  and  Tom  does  not.  This  is  captured  in  the  second  row  of  this  truth 
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table.  It  is  true  that  not  both  Steve  and  Tom  will  go  to  Berlin,  if  Steve  does  not  go  but 
Tom  does.  This  is  captured  in  the  third  row  of  this  truth  table.  In  both  kinds  of  cases  (in 
both  rows  of  the  truth  table),  i(UAV)  is  true  but  (iUAiV)  is  false.  Thus,  we  can  see  that 
-(UAV)  is  the  correct  translation  of  “Not  both  Steve  and  Tom  will  go  to  Berlin”. 

Let’s  consider  a  more  complex  sentence  that  uses  all  of  our  connectives  so  far: 
((PA^Q)^(P->Q)).  This  sentence  is  a  conditional.  The  antecedent  is  a  conjunction.  The 
consequent  is  a  negation.  Here  is  the  truth  table,  completed. 


p 

Q 

-Q 

(p-Q) 

1 3 
> 
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O 

-’(p-Q) 

((pa-Q)_(p-Q)) 
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This  sentence  has  an  interesting  property:  it  cannot  be  false.  That  is  not  surprising,  once 
we  think  about  what  it  says.  In  English,  the  sentence  says:  If  P  is  true  and  Q  is  false,  then 
it  is  not  the  case  that  P  implies  Q.  That  must  be  true:  if  it  were  the  case  that  P  implies  Q, 
then  if  P  is  true  then  Q  is  true.  But  the  antecedent  says  P  is  true  and  Q  is  false. 

Sentences  of  the  propositional  logic  that  must  be  true  are  called  “tautologies”.  We  will 
discuss  them  at  length  in  later  chapters. 

Finally,  note  that  we  can  combine  this  method  for  finding  the  truth  conditions  for  a 
complex  sentence  with  our  method  for  determining  whether  an  argument  is  valid  using  a 
truth  table.  We  will  need  to  do  this  if  any  of  our  premises  or  the  conclusion  are  complex. 
Here  is  an  example.  Well  start  with  an  argument  in  English: 

If  whales  are  mammals,  then  they  have  vestigial  limbs.  If  whales  are  mammals, 
then  they  have  a  quadrupedal  ancestor.  Therefore,  if  whales  are  mammals  then 
they  have  a  quadrupedal  ancestor  and  they  have  vestigial  limbs. 

We  need  a  translation  key. 

P:  Whales  are  mammals. 

Q:  Whales  have  have  vestigial  limbs. 

R:  Whales  have  a  quadrupedal  ancestor. 


The  argument  will  then  be  symbolized  as: 
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(P-Q) 

(P-R) 


(P-(RAQ)) 


Here  is  a  semantic  check  of  the  argument. 


premise 

premise 

conclusion 

p 

Q 

R 

(P-Q) 

(P-R) 
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We  have  highlighted  the  rows  where  the  premises  are  all  true.  Note  that  for  these,  the 
conclusion  is  true.  Thus,  in  any  kind  of  situation  in  which  all  the  premises  are  true,  the 
conclusion  is  true.  This  is  equivalent,  we  have  noted,  to  our  definition  of  valid:  necessarily, 
if  all  the  premises  are  true,  the  conclusion  is  true.  So  this  is  a  valid  argument.  The  third 
column  of  the  analyzed  sentences  (the  column  for  (RAQ))  is  there  so  that  we  can  identify 
when  the  conclusion  is  true.  The  conclusion  is  a  conditional,  and  we  needed  to  know,  for 
each  kind  of  situation,  if  its  antecedent  P,  and  if  its  consequent  (RAQ),  are  true.  The  third 
column  tells  us  the  situations  in  which  the  consequent  is  true.  The  stipulations  on  the  left 
tell  us  in  what  kind  of  situation  the  antecedent  P  is  true. 

5.6  Problems 

1.  Translate  the  following  sentences  into  our  logical  language.  You  will  need  to  create 
your  own  key  to  do  so. 

a.  Ulysses,  who  is  crafty,  is  from  Ithaca. 

b.  If  Ulysses  outsmarts  both  Circes  and  the  Cyclops,  then  he  can  go  home. 

c.  Ulysses  can  go  home  only  if  he  isn’t  from  Troy. 

d.  Ulysses  is  from  Ithaca  but  not  from  Troy. 

e.  Ulysses  is  not  both  crafty  and  from  Ithaca. 

2.  Prove  the  following  arguments  are  valid,  using  a  direct  derivation. 
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a.  Premise:  ((P-Q)  -  iQ).  Conclusion:  -iP. 

b.  Premises:  ((P-Q)  "  (R-S)),  (-.Q  *  -nS).  Conclusion:  (M>  -  -hR). 

c.  Premises:  ((R  A  S)  -  T),  (Q  *  -.T).  Conclusion:  -(R  A  s). 

d.  Premises:  (P  ->  (R  ->  S)),  (R  -  P).  Conclusion:  S. 

e.  Premises:  (P  ->  (R  -  S)),  (iS  -  P).  Conclusion:  -iR. 

3.  Make  truth  tables  for  the  following  complex  sentences.  Identify  which  are  tautologies. 

a.  (((P-Q)A--QH-.P) 

b.  -i(P  A  Q) 

c.  -i(-,P  ~ >-,Q) 

d.  (PAM>) 

e.  -^(PA-.P) 

4.  Make  truth  tables  to  show  when  the  following  sentences  are  true  and  when  they  are 
false.  State  which  of  these  sentences  are  equivalent. 

a.  ->(PAQ) 

b.  (iPAiQ) 

c.  ->(P-*Q) 

d.  (PAiQ) 

e.  (iPAQ) 

f.  ->(-'P->-'Q) 


5.  Write  a  valid  argument  in  normal  colloquial  English  with  at  least  two  premises,  one  of 
which  is  a  conjunction  or  includes  a  conjunction.  Your  argument  should  just  be  a 
paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  like  formal 
logic).  Translate  the  argument  into  propositional  logic.  Prove  it  is  valid. 


6.  Write  a  valid  argument  in  normal  colloquial  English  with  at  least  three  premises,  one  of 
which  is  a  conjunction  or  includes  a  conjunction  and  one  of  which  is  a  conditional  or 
includes  a  conditional.  Translate  the  argument  into  propositional  logic.  Prove  it  is 
valid. 


7.  Make  your  own  key  to  translate  the  following  argument  into  our  propositional  logic. 
Translate  only  the  parts  in  bold.  Prove  the  argument  is  valid. 

“I  suspect  Dr.  Kronecker  of  the  crime  of  stealing  Cantor’s  book,”  Inspector  Tarski 
said.  His  assistant,  Mr.  Carroll,  waited  patiently  for  his  reasoning.  “For,”  Tarski 
said,  “The  thief  left  cigarette  ashes  on  the  table.  The  thief  also  did  not  wear  shoes, 
but  slipped  silently  into  the  room.  Thus,  If  Dr.  Kronecker  smokes  and  is  in  his 
stocking  feet,  then  he  most  likely  stole  Cantor’s  book.”  At  this  point,  Tarski 
pointed  at  Kronecker’s  feet.  “Dr.  Kronecker  is  in  his  stocking  feet.”  Tarski 
reached  forward  and  pulled  from  Kronecker’s  pocket  a  gold  cigarette  case.  “And 
Kronecker  smokes.”  Mr.  Carroll  nodded  sagely,  “Your  conclusion  is  obvious: 
Dr.  Kronecker  most  likely  stole  Cantor’s  book.” 
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6.1  An  argument  from  Hobbes 

In  his  great  work,  Leviathan,  the  philosopher  Thomas  Hobbes  (1588-1679)  gives  an 
important  argument  for  government.  Hobbes  begins  by  claiming  that  without  a  common 
power,  our  condition  is  very  poor  indeed.  He  calls  this  state  without  government,  “the  state 
of  nature”,  and  claims 

Hereby  it  is  manifest  that  during  the  time  men  live  without  a  common  power  to 
keep  them  all  in  awe,  they  are  in  that  condition  which  is  called  war;  and  such  a  war  as 
is  of  every  man  against  every  man. ...  In  such  condition  there  is  no  place  for  industry, 
because  the  fruit  thereof  is  uncertain:  and  consequently  no  culture  of  the  earth;  no 
navigation,  nor  use  of  the  commodities  that  may  be  imported  by  sea;  no  commodious 
building;  no  instruments  of  moving  and  removing  such  things  as  require  much  force; 
no  knowledge  of  the  face  of  the  earth;  no  account  of  time;  no  arts;  no  letters;  no 
society;  and  which  is  worst  of  all,  continual  fear,  and  danger  of  violent  death;  and  the 
life  of  man,  solitary,  poor,  nasty,  brutish,  and  short. M 

Hobbes  developed  what  is  sometimes  called  “contract  theory”.  This  is  a  view  of 
government  in  which  one  views  the  state  as  the  product  of  a  rational  contract.  Although 
we  inherit  our  government,  the  idea  is  that  in  some  sense  we  would  find  it  rational  to 
choose  the  government,  were  we  ever  in  the  position  to  do  so.  So,  in  the  passage  above, 
Hobbes  claims  that  in  this  state  of  nature,  we  have  absolute  freedom,  but  this  leads  to 
universal  struggle  between  all  people.  There  can  be  no  property,  for  example,  if  there  is  no 
power  to  enforce  property  rights.  You  are  free  to  take  other  people’s  things,  but  they  are 
also  free  to  take  yours.  Only  violence  can  discourage  such  theft.  But,  a  common  power, 
like  a  king,  can  enforce  rules,  such  as  property  rights.  To  have  this  common  power,  we 
must  give  up  some  freedoms.  You  are  (or  should  be,  if  it  were  ever  up  to  you)  willing  to 
give  up  those  freedoms  because  of  the  benefits  that  you  get  from  this.  For  example,  you  are 
willing  to  give  up  the  freedom  to  just  seize  people’s  goods,  because  you  like  even  more  that 
other  people  cannot  seize  your  goods. 

We  can  reconstruct  Hobbes’s  defense  of  government,  greatly  simplified,  as  being 
something  like  this: 

If  we  want  to  be  safe,  then  we  should  have  a  state  that  can  protect  us. 

If  we  should  have  a  state  that  can  protect  us,  then  we  should  give  up  some 

freedoms. 

Therefore,  if  we  want  to  be  safe,  then  we  should  give  up  some  freedoms. 

Let  us  use  the  following  translation  key. 

P:  We  want  to  be  safe. 
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Q:  We  should  have  a  state  that  can  protect  us. 

R:  We  should  give  up  some  freedoms. 

The  argument  in  our  logical  language  would  then  be: 

(P-Q) 

(Q-R) 

(P-R) 

This  is  a  valid  argument.  Let’s  take  the  time  to  show  this  with  a  truth  table. 


The  rows  in  which  all  the  premises  are  true  are  the  first,  fifth,  seventh,  and  eighth  rows. 
Note  that  in  each  such  row,  the  conclusion  is  true.  Thus,  in  any  kind  of  situation  where 
the  premises  are  true,  the  conclusion  is  true.  This  is  our  semantics  for  a  valid  argument. 

What  syntactic  method  can  we  use  to  prove  this  argument  is  valid?  Right  now,  we  have 
none.  Other  than  double  negation,  we  cannot  even  apply  any  of  our  inference  rules  using 
these  premises. 

Some  logic  systems  introduce  a  rule  to  capture  this  inference;  this  rule  is  typically  called  the 
“chain  rule”.  But,  there  is  a  more  general  principle  at  stake  here:  we  need  a  way  to  show 
conditionals.  So  we  want  to  take  another  approach  to  showing  this  argument  is  valid. 

6.2  Conditional  derivation 

As  a  handy  rule  of  thumb,  we  can  think  of  the  inference  rules  as  providing  a  way  to  either 
show  a  kind  of  sentence,  or  to  make  use  of  a  kind  of  sentence.  For  example,  adjunction 
allows  us  to  show  a  conjunction.  Simplification  allows  us  to  make  use  of  a  conjunction. 
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But  this  pattern  is  not  complete:  we  have  rules  to  make  use  of  a  conditional  (modus  ponens 
and  modus  tollens),  but  no  rule  to  show  a  conditional. 

We  will  want  to  have  some  means  to  prove  a  conditional,  because  sometimes  an  argument 
will  have  a  conditional  as  a  conclusion.  It  is  not  clear  what  rule  we  should  introduce, 
however.  The  conditional  is  true  when  the  antecedent  is  false,  or  if  both  the  antecedent 
and  the  consequent  are  true.  That’s  a  rather  messy  affair  for  making  an  inference  rule. 

However,  think  about  what  the  conditional  asserts:  if  the  antecedent  is  true,  then  the 
consequent  is  true.  We  can  make  use  of  this  idea  not  with  an  inference  rule,  but  rather  in 
the  very  structure  of  a  proof.  We  treat  the  proof  as  embodying  a  conditional  relationship. 

Our  idea  is  this:  let  us  assume  some  sentence,  $>.  If  we  can  then  prove  another  sentence  W, 
we  will  have  proved  that  if  $  is  true  then  W  is  true.  The  proof  structure  will  thus  have  a 
shape  like  this: 


> 


(<P  ->•  vo 


The  last  line  of  the  proof  is  justified  by  the  shape  of  the  proof:  by  assuming  that  $  is  true, 
and  then  using  our  inference  rules  to  prove  W,  we  know  that  if  $  is  true  then  W  is  true. 
And  this  is  just  what  the  conditional  asserts. 

This  method  is  sometimes  referred  to  as  an  application  of  the  deduction  theorem.  In 
chapter  17  we  will  prove  the  deduction  theorem.  Here,  instead,  we  shall  think  of  this  as  a 
proof  method,  traditionally  called  “conditional  derivation”. 

A  conditional  derivation  is  like  a  direct  derivation,  but  with  two  differences.  First,  along 
with  the  premises,  you  get  a  single  special  assumption,  called  “the  assumption  for 
conditional  derivation”.  Second,  you  do  not  aim  to  show  your  conclusion,  but  rather  the 
consequent  of  your  conclusion.  So,  to  show  you  will  always  assume  $  and  try  to 

show  W.  Also,  in  our  logical  system,  a  conditional  derivation  will  always  be  a  subproof.  A 
subproof  is  a  proof  within  another  proof.  We  always  start  with  a  direct  proof,  and  then  do 
the  conditional  proof  within  that  direct  proof. 

Here  is  how  we  would  apply  the  proof  method  to  prove  the  validity  of  Hobbes’s  argument, 
as  we  reconstructed  it  above. 
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1.  (P  — »  Q)  premise 

2.  (Q  — >  R))  premise 


3.  P 

4Q 
5.  R 

6.  (P  ->  R) 


assumption  for  conditional  derivation 

modus  ponens,  1,  3 
modus  ponens,  2,  4 
conditional  derivation,  3-5 


Our  Fitch  bars  make  clear  what  is  a  sub-proof  here;  they  let  us  see  this  as  a  direct  derivation 
with  a  conditional  derivation  embedded  in  it.  This  is  an  important  concept:  we  can  have 
proofs  within  proofs. 

An  important  principle  is  that  once  a  subproof  is  done,  we  cannot  use  any  of  the  lines  in  the 
subproof.  We  need  this  rule  because  conditional  derivation  allowed  us  to  make  a  special 
assumption  that  we  use  only  temporarily.  Above,  we  assumed  P.  Our  goal  is  only  to  show 
that  if  P  is  true,  then  R  is  true.  But  perhaps  P  isn’t  true.  We  do  not  want  to  later  make 
use  of  P  for  some  other  purpose.  So,  we  have  the  rule  that  when  a  subproof  is  complete, 
you  cannot  use  the  lines  that  occur  in  the  subproof.  In  this  case,  that  means  that  we  cannot 
use  lines  3,  4,  or  5  for  any  other  purpose  than  to  show  the  conditional  (P^R).  We  cannot 
now  cite  those  individual  lines  again.  We  can,  however,  use  line  6,  the  conclusion  of  the 
subproof. 

The  Fitch  bars — which  we  have  used  before  now  in  our  proofs  only  to  separate  the 
premises  from  the  later  steps — now  have  a  very  beneficial  use.  They  allow  us  to  set  aside  a 
conditional  derivation  as  a  subproof,  and  they  help  remind  us  that  we  cannot  cite  the  lines 
in  that  subproof  once  the  subproof  is  complete. 

It  might  be  helpful  to  give  an  example  of  why  this  is  necessary.  That  is,  it  might  be  helpful 
to  give  an  example  of  an  argument  made  invalid  because  it  makes  use  of  lines  in  a  finished 
subproof.  Consider  the  following  argument. 

If  you  are  Pope,  then  you  have  a  home  in  the  Vatican. 

If  you  have  a  home  in  the  Vatican,  then  you  hear  church  bells  often. 


If  you  are  Pope,  then  you  hear  church  bells  often. 

That  is  a  valid  argument,  with  the  same  form  as  the  argument  we  adopted  from  Hobbes. 
However,  if  we  broke  our  rule  about  conditional  derivations,  we  could  prove  that  you  are 
Pope.  Let’s  use  this  key: 

S:  You  are  Pope. 


T:  You  have  a  home  in  the  Vatican. 
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U:  You  hear  church  bells  often. 


Now  consider  this  “proof”: 


1.  (S  T)  premise 

2.  (T  — >  U))  premise 


3.  S 

4  T 

5.  U 

6.  (S  U) 

7.  S 


assumption  for  conditional  derivation 

modus  ponens,  1,  3 
modus  ponens,  2,  4 
conditional  derivation,  3-5 
repeat,  3 


And,  thus,  we  have  proven  that  you  are  Pope.  But,  of  course,  you  are  not  the  Pope.  From 
true  premises,  we  ended  up  with  a  false  conclusion,  so  the  argument  is  obviously  invalid. 
What  went  wrong?  The  problem  was  that  after  we  completed  the  conditional  derivation 
that  occurs  in  lines  3  through  5,  and  used  that  conditional  derivation  to  assert  line  6,  we 
can  no  longer  use  those  lines  3  through  5.  But  on  line  7  we  made  use  of  line  3.  Line  3  is 
not  something  we  know  to  be  true;  our  reasoning  from  lines  3  through  line  5  was  to  ask,  if 
S  were  true,  what  else  would  be  true?  When  we  are  done  with  that  conditional  derivation, 
we  can  use  only  the  conditional  that  we  derived,  and  not  the  steps  used  in  the  conditional 
derivation. 


6.3  Some  additional  examples 

Here  are  a  few  kinds  of  arguments  that  help  illustrate  the  power  of  the  conditional 
derivation. 

This  argument  makes  use  of  conjunctions. 

(P-Q) 

(R-S) 


((P*RHQAS)) 

We  always  begin  by  constructing  a  direct  proof,  using  the  Fitch  bar  to  identify  the  premises 
of  our  argument,  if  any. 


1.  (P  -a  Q) 

2.  (R  -a  S) 


premise 

premise 
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Because  the  conclusion  is  a  conditional,  we  assume  the  antecedent  and  show  the 
consequent. 


(P  -a  Q) 

premise 

(R  -a  S) 

premise 

3.  (PAR) 

assumption  for  conditional  derivation 

4  P 

simplification,  3 

5.  Q 

modus  ponens,  1,  4 

6.  R 

simplification,  3 

7.  S 

modus  ponens,  2,  6 

8.  (Q  AS) 

adjunction,  5,  7 

((PAR)n(QAS)) 

conditional  derivation,  3-8 

Here’s  another  example.  Note  that  the  following  argument  is  valid. 

(P-(S-R)) 

(P-(Q-S)) 


(P-(Q-R)) 


The  proof  will  require  several  embedded  subproofs. 


1.  (P  — >■  (S  — >  R))  premise 

2.  (P  — >■  (Q  — »  S))  premise 


3.  P 


assumption  for  conditional  derivation 


4.  Q 


assumption  for  conditional  derivation 


5.  (Q  -A  S) 

6.  S 

7.  (S  -A  R) 

8.  R 

9.  (Q  -A  R) 

10.  (P->(Q->R)) 


modus  ponens,  2,  3 
modus  ponens,  5,  4 
modus  ponens,  1,  3 
modus  ponens,  7,  6 
conditional  derivation,  4-8 
conditional  derivation,  3-9 


6.4  Theorems 

Conditional  derivation  allows  us  to  see  an  important  new  concept.  Consider  the  following 
sentence: 


((P->Q)  -(-Q-mP)) 
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This  sentence  is  a  tautology.  To  check  this,  we  can  make  its  truth  table. 


p 

O' 

-Q 

-.P 

(P- 

♦Q) 

(iQ—P) 

((p-Q)  -  (-’Q—p)) 

T 

T 

F 
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T 

T 

T 
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T 
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T 

T 

This  sentence  is  true  in  every  kind  of  situation,  which  is  what  we  mean  by  a  “tautology”. 

Now  reflect  on  our  definition  of  “valid”:  necessarily,  if  the  premises  are  true,  then  the 
conclusion  is  true.  What  about  an  argument  in  which  the  conclusion  is  a  tautology?  By 
our  definition  of  “valid”,  an  argument  with  a  conclusion  that  must  be  true  must  be  a  valid 
argument — no  matter  what  the  premises  are!  (If  this  confuses  you,  look  back  at  the  truth 
table  for  the  conditional.  Our  definition  of  valid  includes  the  conditional:  if  the  premises 
are  true,  then  the  conclusion  is  true.  Suppose  now  our  conclusion  must  be  true.  Any 
conditional  with  a  true  consequent  is  true.  So  the  definition  of  “valid”  must  be  true  of 
any  argument  with  a  tautology  as  a  conclusion.)  And,  given  that,  it  would  seem  that  it  is 
irrelevant  whether  we  have  any  premises  at  all,  since  any  will  do.  This  suggests  that  there 
can  be  valid  arguments  with  no  premises. 

Conditional  derivation  lets  us  actually  construct  such  arguments.  First,  we  will  draw  our 
Fitch  bar  for  our  main  argument  to  indicate  that  we  have  no  premises.  Then,  we  will 
construct  a  conditional  derivation.  It  will  start  like  this: 


l.  (P  -»  Q) 


assumption  for  conditional  derivation 


But  what  now?  Well,  we  have  assumed  the  antecedent  of  our  sentence,  and  we  should 
strive  now  to  show  the  consequent.  But  note  that  the  consequent  is  a  conditional.  So,  we 
will  again  do  a  conditional  derivation. 


assumption  for  conditional  derivation 
assumption  for  conditional  derivation 
modus  tollens,  1,  2 

|  4.  (->Q  — >  -iP)  conditional  derivation,  2-3 

5.  ((P  -*  Q)  — >  (-iQ  — *  -iP))  conditional  derivation,  1-4 


l.  (P  -►  Q) 

2.  -.Q 

3.  -P 


This  is  a  proof,  without  premises,  of  ((P-»Q)-»(-iQ-»-iP)).  The  top  of  the  proof  shows 
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that  we  have  no  premises.  Our  conclusion  is  a  conditional,  so,  on  line  1,  we  assumed  the 
antecedent  of  the  conditional.  We  now  have  to  show  the  consequent  of  the  conditional; 
but  the  consequent  of  the  conditional  is  also  a  conditional,  so  we  assumed  its  antecedent  on 
line  2.  Line  4  is  the  result  of  the  conditional  derivation  from  lines  2  to  3.  Lines  1  through 
4  tell  us  that  if  (P— Q)  is  true,  then  (iQ— -iP)  is  true.  And  that  is  what  we  conclude  on  line 
5. 

We  call  a  sentence  that  can  be  proved  without  premises  a  “theorem”.  Theorems  are  special 
because  they  reveal  the  things  that  follow  from  logic  alone.  It  is  a  very  great  benefit  of  our 
propositional  logic  that  all  the  theorems  are  tautologies.  It  is  an  equally  great  benefit  of  our 
propositional  logic  that  all  the  tautologies  are  theorems.  Nonetheless,  these  concepts  are 
different.  “Tautology”  refers  to  a  semantic  concept:  a  tautology  is  a  sentence  that  must  be 
true.  “Theorem”  refers  to  a  concept  of  syntax  and  derivation:  a  theorem  is  a  sentence  that 
can  be  derived  without  premises. 

Theorem:  a  sentence  that  can  be  proved  without  premises. 

Tautology:  a  sentence  of  the  propositional  logic  that  must  be  true. 

6.5  Problems 

1.  Prove  the  following  arguments  are  valid.  This  will  require  conditional  derivation. 

a.  Premise:  (P->Q),  (S-R).  Conclusion:  ((-.Q  A  ->R)  -  (-.P  A  -.S). 

b.  Premise:  (P-Q).  Conclusion:  ((P  A  R)  -Q). 

c.  Premise:  ((RAQ)  -  S),  (-.P-(RAQ)).  Conclusion:  (-.S-P). 

d.  Premise:  (P^Q).  Conclusion:  (Q— p)- 

e.  Premises:  (P-Q),  (P-R).  Conclusion:  (P-(QAR))). 

f.  Premises:  (P-(Q-R)),  Q.  Conclusion:  (P-R). 

2.  Prove  the  following  theorems. 

a.  (P-P). 

b.  ((P-Q)->((R->P)-(R->Q))). 

c.  ((P-(Q-R))-((P-Q)-(P-R)). 

d.  ((M>-Q)  -  (^Q-P)). 

e.  (((P-Q)  A  (P-R))  -  (P-(QAR))). 

3.  Make  a  truth  table  for  each  of  the  following  complex  sentences,  in  order  to  see  when  it 
is  true  or  false.  Identify  which  are  tautologies.  Prove  the  tautologies. 

a.  ((P-Q)-Q). 

b.  (P-(P-Q)). 

c.  (P-(Q-P)). 

d.  (P— .P). 

e.  (P— P). 
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4.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 

premises  and  with  a  conclusion  that  is  a  conditional.  Your  argument  should  just  be  a 
paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  like  formal 
logic).  Translate  it  into  propositional  logic  and  prove  it  is  valid. 


[8]  Hobbes  (1886:  64). 


7.  "Or" 


7.1  A  historical  example:  The  Euthryphro  argument 

The  philosopher  Plato  (who  lived  from  approximately  427  BC  to  347  BC)  wrote  a  series  of 
great  philosophical  texts.  Plato  was  the  first  philosopher  to  deploy  argument  in  a  vigorous 
and  consistent  way,  and  in  so  doing  he  showed  how  philosophy  takes  logic  as  its  essential 
method.  We  think  of  Plato  as  the  principal  founder  of  Western  philosophy.  The  American 
philosopher  Alfred  Whitehead  (1861-1947)  in  fact  once  famously  quipped  that  philosophy 
is  a  “series  of  footnotes  to  Plato”. 

Plato’s  teacher  was  Socrates  (c.  469-399  B.C.),  a  gadfly  of  ancient  Athens  who  made  many 
enemies  by  showing  people  how  little  they  knew.  Socrates  did  not  write  anything,  but 
most  of  Plato’s  writings  are  dialogues,  which  are  like  small  plays,  in  which  Socrates  is  the 
protagonist  of  the  philosophical  drama  that  ensues.  Several  of  the  dialogues  are  named  after 
the  person  who  will  be  seen  arguing  with  Socrates.  In  the  dialogue  Euthyphro,  Socrates  is 
standing  in  line,  awaiting  his  trial.  He  has  been  accused  of  corrupting  the  youth  of  Athens. 
A  trial  in  ancient  Athens  was  essentially  a  debate  before  the  assembled  citizen  men  of  the 
city.  Before  Socrates  in  line  is  a  young  man,  Euthyphro.  Socrates  asks  Euthyphro  what  his 
business  is  that  day,  and  Euthyphro  proudly  proclaims  he  is  there  to  charge  his  own  father 
with  murder.  Socrates  is  shocked.  In  ancient  Athens,  respect  for  one’s  father  was  highly 
valued  and  expected.  Socrates,  with  characteristic  sarcasm,  tells  Euthyphro  that  he  must  be 
very  wise  to  be  so  confident.  Here  are  two  profound  and  conflicting  duties:  to  respect 
one’s  father,  and  to  punish  murder.  Euthyphro  seems  to  find  it  very  easy  to  decide  which 
is  the  greater  duty.  Euthyphro  is  not  bothered.  To  him,  these  ethical  matters  are  simple: 
one  should  be  pious.  When  Socrates  demands  a  definition  of  piety  that  applies  to  all  pious 
acts,  Euthyphro  says, 

Piety  is  that  which  is  loved  by  the  gods  and  impiety  is  that  which  is  not  loved  by 
them. 

Socrates  observes  that  this  is  ambiguous.  It  could  mean,  an  act  is  good  because  the  gods 
love  that  act.  Or  it  could  mean,  the  gods  love  an  act  because  it  is  good.  We  have,  then,  an 
“or”  statement,  which  logicians  call  a  “disjunction”: 

Either  an  act  is  good  because  the  gods  love  that  act,  or  the  gods  love  an  act  because 
it  is  good. 

Might  the  former  be  true?  This  view — that  an  act  is  good  because  the  gods  love  it — is 
now  called  “divine  command  theory”,  and  theists  have  disagreed  since  Socrates’s  time  about 
whether  it  is  true.  But,  Socrates  finds  it  absurd.  For,  if  tomorrow  the  gods  love,  say, 
murder,  then,  tomorrow  murder  would  be  good. 

Euthyphro  comes  to  agree  that  it  cannot  be  that  an  act  is  good  because  the  gods  love  that 
act.  Our  argument  so  far  has  this  form: 
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Either  an  act  is  good  because  the  gods  love  that  act,  or  the  gods  love  an  act  because 
it  is  good. 

It  is  not  the  case  that  an  act  is  good  because  the  gods  love  it. 

Socrates  concludes  that  the  gods  love  an  act  because  it  is  good. 

Either  an  act  is  good  because  the  gods  love  that  act,  or  the  gods  love  an  act  because 
it  is  good. 

It  is  not  the  case  that  an  act  is  good  because  the  gods  love  it. 


The  gods  love  an  act  because  it  is  good. 

This  argument  is  one  of  the  most  important  arguments  in  philosophy.  Most  philosophers 
consider  some  version  of  this  argument  both  valid  and  sound.  Some  who  disagree  with  it 
bite  the  bullet  and  claim  that  if  tomorrow  God  (most  theistic  philosophers  alive  today  are 
monotheists)  loved  puppy  torture,  adultery,  random  acts  of  cruelty,  pollution,  and  lying, 
these  would  all  be  good  things.  (If  you  are  inclined  to  say,  “That  is  not  fair,  God  would 
never  love  those  things”,  then  you  have  already  agreed  with  Socrates.  For,  the  reason  you 
believe  that  God  would  never  love  these  kinds  of  acts  is  because  these  kinds  of  acts  are 
bad.  But  then,  being  bad  or  good  is  something  independent  of  the  love  of  God.)  But 
most  philosophers  agree  with  Socrates:  they  find  it  absurd  to  believe  that  random  acts  of 
cruelty  and  other  such  acts  could  be  good.  There  is  something  inherently  bad  to  these 
acts,  they  believe.  The  importance  of  the  Euthyphro  argument  is  not  that  it  helps  illustrate 
that  divine  command  theory  is  an  enormously  strange  and  costly  position  to  hold  (though 
that  is  an  important  outcome),  but  rather  that  the  argument  shows  ethics  can  be  studied 
independently  of  theology.  For,  if  there  is  something  about  acts  that  makes  them  good  or 
bad  independently  of  a  god’s  will,  then  we  do  not  have  to  study  a  god’s  will  to  study  what 
makes  those  acts  good  or  bad. 

Of  course,  many  philosophers  are  atheists  so  they  already  believed  this,  but  for  most  of 
philosophy’s  history,  one  was  obliged  to  be  a  theist.  Even  today,  lay  people  tend  to  think 
of  ethics  as  an  extension  of  religion.  Philosophers  believe  instead  that  ethics  is  its  own  field 
of  study.  The  Euthyphro  argument  explains  why,  even  if  you  are  a  theist,  you  can  study 
ethics  independently  of  studying  theology. 

But  is  Socrates’s  argument  valid?  Is  it  sound? 

7.2  The  disjunction 

We  want  to  extend  our  language  so  that  it  can  represent  sentences  that  contain  an  “or”. 
Sentences  like 

Tom  will  go  to  Berlin  or  Paris. 

We  have  coflee  or  tea. 
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This  web  page  contains  the  phrase  “Mark  Twain”  or  “Samuel  Clemens.” 

Logicians  call  these  kinds  of  sentences  “disjunctions”.  Each  of  the  two  parts  of  a  disjunction 
is  called  a  “disjunct”.  The  idea  is  that  these  are  really  equivalent  to  the  following  sentences: 

Tom  will  go  to  Berlin  or  Tom  will  go  to  Paris. 

We  have  coffee  or  we  have  tea. 

This  web  page  contains  the  phrase  “Mark  Twain”  or  this  web  page  contains  the 
phrase  “Samuel  Clemens.” 

We  can,  therefore,  see  that  (at  least  in  many  sentences)  the  “or”  operates  as  a  connective 
between  two  sentences. 

It  is  traditional  to  use  the  symbol  “v”  for  “or”.  This  comes  from  the  Latin  “vel,”  meaning 
(in  some  contexts)  or. 

The  syntax  for  the  disjunction  is  very  basic.  If  $  and  VP  are  sentences,  then 
($  v^P) 
is  a  sentence. 

The  semantics  is  a  little  more  controversial.  This  much  of  the  defining  truth  table,  most 
people  find  obvious: 
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Consider:  if  I  promise  that  I  will  bring  you  roses  or  lilacs,  then  it  seems  that  I  told  the  truth 
either  if  I  have  brought  you  roses  but  not  lilacs,  or  if  I  brought  you  lilacs  but  not  roses. 
Similarly,  the  last  row  should  be  intuitive,  also.  If  I  promise  I  will  bring  you  roses  or  lilacs, 
and  I  bring  you  nothing,  then  I  spoke  falsely. 

What  about  the  first  row?  Many  people  who  are  not  logicians  want  it  to  be  the  case  that 
we  define  this  condition  as  false.  The  resulting  meaning  would  correspond  to  what  is 
sometimes  called  the  “exclusive  ‘or’”.  Logicians  disagree.  They  favor  the  definition  where 
a  disjunction  is  true  if  its  two  parts  are  true;  this  is  sometimes  called  the  “inclusive  ‘or’”.  Of 
course,  all  that  matters  is  that  we  pick  a  definition  and  stick  with  it,  but  we  can  older  some 
reasons  why  the  “inclusive  ‘or’”,  as  we  call  it,  is  more  general  than  the  “exclusive  ‘or’”. 

Consider  the  first  two  sentences  above.  It  seems  that  the  first  sentence — “Tom  will  go  to 
Berlin  or  Paris” — should  be  true  if  Tom  goes  to  both.  Or  consider  the  second  sentence, 
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“We  have  coflee  or  tea.”  In  most  restaurants,  this  means  they  have  both  coffee  and  they 
have  tea,  but  they  expect  that  you  will  order  only  one  of  these.  After  all,  it  would  be  strange 
to  be  told  that  they  have  coffee  or  tea,  and  then  be  told  that  it  is  false  that  they  have  both 
coffee  and  tea.  Or,  similarly,  suppose  the  waiter  said,  “We  have  coffee  or  tea”,  and  then 
you  said  “I’ll  have  both”,  and  the  waiter  replied  “We  don’t  have  both”.  This  would  seem 
strange.  But  if  you  find  it  strange,  then  you  implicitly  agree  that  the  disjunction  should  be 
interpreted  as  the  inclusive  “or”. 

Examples  like  these  suggest  to  logicians  that  the  inclusive  “or”  (where  the  first  row  of  the 
table  is  true)  is  the  default  case,  and  that  the  context  of  our  speech  tells  us  when  not  both 
disjuncts  are  true.  For  example,  when  a  restaurant  has  a  fixed  price  menu — where  you  pay 
one  fee  and  then  get  either  steak  or  lobster — it  is  understood  by  the  context  that  this  means 
you  can  have  one  or  the  other  but  not  both.  But  that  is  not  logic,  that  is  social  custom. 
One  must  know  about  restaurants  to  determine  this. 

Thus,  it  is  customary  to  define  the  semantics  of  the  disjunction  as 
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We  haven’t  lost  the  ability  to  express  the  exclusive  “or”.  We  can  say,  “one  or  the  other  but 
not  both”,  which  is  expressed  by  the  formula  v  VF)  A  -i($>  A  W))”.  To  check,  we  can 
make  the  truth  table  for  this  complex  expression: 
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Note  that  this  formula  is  equivalent  to  the  exclusive  “or”  (it  is  true  when  $>  is  true  or  VF  is 
true,  but  not  when  both  are  true  or  both  are  false).  So,  if  we  need  to  say  something  like  the 
exclusive  “or”,  we  can  do  so. 


7.3  Alternative  forms 

There  do  not  seem  to  be  many  alternative  expressions  in  English  equivalent  to  the  “or”. 
We  have 


P  or  Q 
Either  P  or  Q 
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These  are  both  expressed  in  our  logic  with  (P  v  Q). 

One  expression  that  does  arise  in  English  is  “neither... nor...”.  This  expression  seems  best 
captured  by  simply  making  it  into  “not  either...  or...”.  Let’s  test  this  proposal.  Consider 
the  sentence 

Neither  Smith  nor  Jones  will  go  to  London. 

This  sentence  expresses  the  idea  that  Smith  will  not  go  to  London,  and  that  Jones  will  not 
go  to  London.  So,  it  would  surely  be  a  mistake  to  express  it  as 

Either  Smith  will  not  go  to  London  or  Jones  will  not  go  to  London. 

Why?  Because  this  latter  sentence  would  be  true  if  one  of  them  went  to  London  and  one 
of  them  did  not.  Consider  the  truth  table  for  this  expression  to  see  this.  Use  the  following 
translation  key. 

P:  Smith  will  go  to  London. 

Q:  Jones  will  go  to  London. 

Then  suppose  we  did  (wrongly)  translate  “Neither  Smith  nor  Jones  will  go  to  London” 
with 


(iP  v  iQ) 


Here  is  the  truth  table  for  this  expression. 
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Note  that  this  sentence  is  true  if  P  is  true  and  Q  is  false,  or  if  Q  is  true  and  P  is  false.  In 
other  words,  it  is  true  if  one  of  the  two  goes  to  London.  That’s  not  what  we  mean  in 
English  by  that  sentence  claiming  that  neither  of  them  will  go  to  London. 

The  better  translation  is  -(PvQ). 
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This  captures  the  idea  well:  it  is  only  true  if  each  does  not  go  to  London.  So,  we  can  simply 
translate  “neither. . .nor. . .”  as  “It  is  not  the  case  that  either. . .  or. . 

7.4  Reasoning  with  disjunctions 

How  shall  we  reason  with  the  disjunction?  Looking  at  the  truth  table  that  defines  the 
disjunction,  we  find  that  we  do  not  know  much  if  we  are  told  that,  say,  (P  v  Q).  P  could 
be  true,  or  it  could  be  false.  The  same  is  so  for  Q.  All  we  know  is  that  they  cannot  both  be 
false. 

This  does  suggest  a  reasonable  and  useful  kind  of  inference  rule.  If  we  have  a  disjunction, 
and  we  discover  that  half  of  it  is  false,  then  we  know  that  the  other  half  must  be  true.  This 
is  true  for  either  disjunct.  This  means  we  have  two  rules,  but  we  can  group  together  both 
rules  with  a  single  name  and  treat  them  as  one  rule: 

(9  vW) 


W 


and 


(<£  v  W) 

-AP 


$ 

This  rule  is  traditionally  called  “modus  tollendo  ponens”. 

What  if  we  are  required  to  show  a  disjunction?  One  insight  we  can  use  is  that  if  some 
sentence  is  true,  then  any  disjunction  that  contains  it  is  true.  This  is  so  whether  the 
sentence  makes  up  the  first  or  second  disjunct.  Again,  then,  we  would  have  two  rules, 
which  we  can  group  together  under  one  name: 

$ 


($vf) 


and 
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W 


($vf) 

This  rule  is  often  called  “addition”. 

The  addition  rule  often  confuses  students.  It  seems  to  be  a  cheat,  as  if  we  are  getting  away 
with  something  for  free.  But  a  moment  of  reflection  will  help  clarify  that  just  the  opposite 
is  true.  We  lose  information  when  we  use  the  addition  rule.  If  you  ask  me  where  John  is, 
and  I  say,  “John  is  in  New  York”,  I  told  you  more  than  if  I  answered  you,  “John  is  either 
in  New  York  or  in  New  Jersey”.  Just  so,  when  we  go  from  some  sentence  P  to  (PvQ),  we 
did  not  get  something  for  free. 

This  rule  does  have  the  seemingly  odd  consequence  that  from,  say,  2+2=4  you  can  derive 
that  either  2+2=4  or  7=0.  But  that  only  seems  odd  because  in  normal  speech,  we  have  a 
number  of  implicit  rules.  The  philosopher  Paul  Grice  (1913-1988)  described  some  of  these 
rules,  and  we  sometimes  call  the  rules  he  described  “Grice’s  Maxims”.[9]  He  observed  that 
in  conversation  we  expect  people  to  give  all  the  information  required  but  not  more;  to  try 
to  be  truthful;  to  say  things  that  are  relevant;  and  to  be  clear  and  brief  and  orderly.  So, 
in  normal  English  conversations,  if  someone  says,  “Tom  is  in  New  York  or  New  Jersey,” 
they  would  be  breaking  the  rule  to  give  enough  information,  and  to  say  what  is  relevant, 
if  they  knew  that  Tom  was  in  New  York.  This  also  means  that  we  expect  people  to 
use  a  disjunction  when  they  have  reason  to  believe  that  either  or  both  disjuncts  could  be 
true.  But  our  logical  language  is  designed  only  to  be  precise,  and  we  have  been  making 
the  language  precise  by  specifying  when  a  sentence  is  true  or  false,  and  by  specifying  the 
relations  between  sentences  in  terms  of  their  truth  values.  We  are  thus  not  representing, 
and  not  putting  into  our  language,  Grice’s  maxims  of  conversation.  It  remains  true  that  if 
you  knew  Tom  is  in  New  York,  but  answered  my  question  “Where  is  Tom?”  by  saying 
“Tom  is  in  New  York  or  New  Jersey”,  then  you  have  wasted  my  time.  But  you  did  not  say 
something  false. 

We  are  now  in  a  position  to  test  Socrates’s  argument.  Using  the  following  translation  key, 
we  can  translate  the  argument  into  symbolic  form. 

P:  An  act  is  good  because  the  gods  love  that  act. 

Q:  The  gods  love  an  act  because  it  is  good. 

Euthyphro  had  argued 

I  1.  (P  V  Q)  premise 


Socrates  had  got  Euthryphro  to  admit  that 
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1.  (PVQ) 

premise 

2. 

premise 

And  so  we  have  a  simple  direct  derivation: 

1.  (PVQ) 

premise 

2.  ^P 

premise 

3.  Q 

modus  tollendo  ponens,  1,  2 

Socrates’s  argument  is  valid.  I  will  leave 
argument  is  sound. 

it  up  to  you  to  determine  whether  Socrates’s 

Another  example  might  be  helpful.  Here  is  an  argument  in  our  logical  language. 

(PvQ) 

-.P 

(^P-(Q-R)) 

(R  vS) 

This  will  make  use  of  the  addition  rule,  and 
Here  is  one  possible  proof. 

so  is  useful  to  illustrating  that  rule’s  application. 

1.  (PVQ) 

premise 

2.  ^P 

premise 

3.  (->P  — >■  (Q  ->■  R)) 

premise 

4.  Q 

modus  tollendo  ponens,  1,  2 

5.  (Q  -A  R) 

modus  ponens,  3,  2 

6.  R 

modus  ponens,  5,  4 

7.  (R  V  S) 

addition,  6 

7.5  Alternative  symbolizations  of  disjunction 

We  are  fortunate  that  there  have  been  no  popular  alternatives  to  the  use  of  “v”  as  a  symbol 
for  disjunction.  Perhaps  the  second  most  widely  used  alternative  symbol  was  “II”,  such  that 
(P  v  Q)  would  be  symbolized: 


(P  II  Q) 
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7.6  Problems 

1.  Translate  the  following  passage  into  our  propositional  logic.  Prove  the  argument  is 
valid. 


Either  Dr.  Kronecker  or  Bishop  Berkeley  killed  Colonel  Cardinality.  If  Dr. 
Kronecker  killed  Colonel  Cardinality,  then  Dr.  Kronecker  was  in  the  kitchen.  If 
Bishop  Berkeley  killed  Colonel  Cardinality,  then  he  was  in  the  drawing  room. 
If  Bishop  Berkeley  was  in  the  drawing  room,  then  he  was  wearing  boots.  But 
Bishop  Berkeley  was  not  wearing  boots.  So,  Dr.  Kronecker  killed  the  Colonel. 

2.  Translate  the  following  passage  into  our  propositional  logic.  Prove  the  argument  is 
valid. 


Either  Wittgenstein  or  Meinong  stole  the  diamonds.  If  Meinong  stole  the 
diamonds,  then  he  was  in  the  billiards  room.  But  if  Meinong  was  in  the  library, 
then  he  was  not  in  the  billiards  room.  Therefore,  if  Meinong  was  in  the  library, 
Wittgenstein  stole  the  diamonds. 

3.  Prove  the  following  using  a  derivation. 

a.  Premises:  (PvQ),  (Q-S),  (iSAT).  Conclusion:  (TAP). 

b.  Premises:  ((P^Q)A(R-S)),  (QvR).  Conclusion:  (P-S). 

c.  Premises:  (RvS),  ((S-T)AV),  iT,  ((RAV)-P).  Conclusion:  (PvQ). 

d.  Premises:  ((PAQ)vR),((PAQ)-S),  iS.  Conclusion:  R. 

e.  Conclusion:  ((PvQbQP-Q)). 

4.  Consider  the  following  four  cards  in  figure  7.1.  Each  card  has  a  letter  on  one  side,  and 
a  shape  on  the  other  side. 


Card  1 

Card  2 

Card  3 

Card  4 

★ 

A 

Q 

♦ 

Figure  1.1 

For  each  of  the  following  claims,  determine  (l)  the  minimum  number  of  cards  you  must 
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turn  over  to  check  the  claim,  and  (2)  what  those  cards  are,  in  order  to  determine  if  the  claim 

is  true  of  all  four  cards. 

a.  If  there  is  a  P  or  Q  on  the  letter  side  of  the  card,  then  there  is  a  diamond  on  the  shape 
side  of  the  card. 

b.  If  there  is  a  Q  on  the  letter  side  of  the  card,  then  there  is  either  a  diamond  or  a  star  on 
the  shape  side  of  the  card. 

5.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 
premises,  at  least  one  of  which  is  a  disjunction.  Your  argument  should  just  be  a 
paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  like  formal 
logic).  Translate  it  into  propositional  logic  and  prove  it  is  valid. 

[9]  Grice  (1975). 


8.  Reductio  ad  Absurdum 


8.1  A  historical  example 

In  his  book,  The  Two  New  Sciences}10*  Galileo  Galilea  (1564-1642)  gives  several  arguments 
meant  to  demonstrate  that  there  can  be  no  such  thing  as  actual  infinities  or  actual 
infinitesimals.  One  of  his  arguments  can  be  reconstructed  in  the  following  way.  Galileo 
proposes  that  we  take  as  a  premise  that  there  is  an  actual  infinity  of  natural  numbers  (the 
natural  numbers  are  the  positive  whole  numbers  from  1  on): 

(1,  2,  3,  4,  5,  6,  7,  ....} 

He  also  proposes  that  we  take  as  a  premise  that  there  is  an  actual  infinity  of  the  squares  of 
the  natural  numbers. 

(1,  4,  9,  16,  25,  36,  49,  ....} 

Now,  Galileo  reasons,  note  that  these  two  groups  (today  we  would  call  them  “sets”) 
have  the  same  size.  We  can  see  this  because  we  can  see  that  there  is  a  one-to-one 
correspondence  between  the  two  groups. 


If  we  can  associate  every  natural  number  with  one  and  only  one  square  number,  and  if  we 
can  associate  every  square  number  with  one  and  only  one  natural  number,  then  these  sets 
must  be  the  same  size. 

But  wait  a  moment,  Galileo  says.  There  are  obviously  very  many  more  natural  numbers 
than  there  are  square  numbers.  That  is,  every  square  number  is  in  the  list  of  natural 
numbers,  but  many  of  the  natural  numbers  are  not  in  the  list  of  square  numbers.  The 
following  numbers  are  all  in  the  list  of  natural  numbers  but  not  in  the  list  of  square  numbers. 

[2,  3,  5,  6,  7,  8,  10,  ....} 

So,  Galileo  reasons,  if  there  are  many  numbers  in  the  group  of  natural  numbers  that  are  not 
in  the  group  of  the  square  numbers,  and  if  there  are  no  numbers  in  the  group  of  the  square 
numbers  that  are  not  in  the  naturals  numbers,  then  the  natural  numbers  is  bigger  than  the 
square  numbers.  And  if  the  group  of  the  natural  numbers  is  bigger  than  the  group  of  the 
square  numbers,  then  the  natural  numbers  and  the  square  numbers  are  not  the  same  size. 
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We  have  reached  two  conclusions:  the  set  of  the  natural  numbers  and  the  set  of  the  square 
numbers  are  the  same  size;  and,  the  set  of  the  natural  numbers  and  the  set  of  the  square 
numbers  are  not  the  same  size.  That’s  contradictory. 

Galileo  argues  that  the  reason  we  reached  a  contradiction  is  because  we  assumed  that  there 
are  actual  infinities.  He  concludes,  therefore,  that  there  are  no  actual  infinities. 

8.2  Indirect  proofs 

Our  logic  is  not  yet  strong  enough  to  prove  some  valid  arguments.  Consider  the  following 
argument  as  an  example. 

(P-(QvR)) 

-Q 

->R 


-.P 

This  argument  looks  valid.  By  the  first  premise  we  know:  if  P  were  true,  then  so  would 
(Q  v  R)  be  true.  But  then  either  Q  or  R  or  both  would  be  true.  And  by  the  second  and 
third  premises  we  know:  Q  is  false  and  R  is  false.  So  it  cannot  be  that  (Q  v  R)  is  true,  and 
so  it  cannot  be  that  P  is  true. 


We  can  check  the  argument  using  a  truth  table.  Our  table  will  be  complex  because  one  of 
our  premise  is  complex. 


premise 

premise 

premise 

conclusion 

p 

Q 

R 

(QvR) 

(P-(QvR)) 

-Q 

-.R 

-.P 

T 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

T 

T 

F 

T 

F 

T 

F 

T 

T 

T 

T 

F 

F 

T 

F 

F 

F 

F 

T 

T 

F 

F 

T 

T 

T 

T 

F 

F 

T 

F 

T 

F 

T 

T 

F 

T 

T 

F 

F 

T 

T 

T 

T 

F 

T 

F 

F 

F 

F 

T 

T 

T 

T 

In  any  kind  of  situation  in  which  all  the  premises  are  true,  the  conclusion  is  true.  That  is: 
the  premises  are  all  true  only  in  the  last  row.  For  that  row,  the  conclusion  is  also  true.  So, 
this  is  a  valid  argument. 
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But  take  a  minute  and  try  to  prove  this  argument.  We  begin  with 


1.  (P->  (Qv  R)) 

2. 

3.  - 'R 


premise 

premise 

premise 


And  now  we  are  stopped.  We  cannot  apply  any  of  our  rules.  Here  is  a  valid  argument  that 
we  have  not  made  our  reasoning  system  strong  enough  to  prove. 

There  are  several  ways  to  rectify  this  problem  and  to  make  our  reasoning  system  strong 
enough.  One  of  the  oldest  solutions  is  to  introduce  a  new  proof  method,  traditionally  called 
“reductio  ad  absurdum”,  which  means  a  reduction  to  absurdity.  This  method  is  also  often 
called  an  “indirect  proof”  or  “indirect  derivation”. 

The  idea  is  that  we  assume  the  denial  of  our  conclusion,  and  then  show  that  a  contradiction 
results.  A  contradiction  is  shown  when  we  prove  some  sentence  W,  and  its  negation  -AF. 
This  can  be  any  sentence.  The  point  is  that,  given  the  principle  of  bivalence,  we  must  have 
proven  something  false.  For  if  W  is  true,  then  -iW  is  false;  and  if  -AF  is  true,  then  VF  is  false. 
We  don’t  need  to  know  which  is  false  (VF  or  -AF);  it  is  enough  to  know  that  one  of  them 
must  be. 

Remember  that  we  have  built  our  logical  system  so  that  it  cannot  produce  a  falsehood  from 
true  statements.  The  source  of  the  falsehood  that  we  produce  in  the  indirect  derivation 
must,  therefore,  be  some  falsehood  that  we  added  to  our  argument.  And  what  we  added  to 
our  argument  is  the  denial  of  the  conclusion.  Thus,  the  conclusion  must  be  true. 

The  shape  of  the  argument  is  like  this: 


> 


-ncf) 


-1-0 

0 


Traditionally,  the  assumption  for  indirect  derivation  has  also  been  commonly  called  “the 
assumption  for  reductio”. 

As  a  concrete  example,  we  can  prove  our  perplexing  case. 


92  A  Concise  Introduction  to  Logic 


1.  (P->  (QVR)) 

2.  — 'Q 

3.  - 'R 


premise 

premise 

premise 


4.  -.-.P 

5.  P 

6.  (Q  V  R) 

7.  R 

8.  ~ 'R 


assumption  for  indirect  derivation 

double  negation,  4 
modus  ponens,  1,  5 
modus  tollendo  ponens,  6,  2 
repeat,  3 


9.  — iP 


indirect  derivation,  4-8 


We  assumed  the  denial  of  our  conclusion  on  line  4.  The  conclusion  we  believed  was 
correct  was  -iP,  and  the  denial  of  this  is  mP.  In  line  7,  we  proved  R.  Technically,  we  are 
done  at  that  point,  but  we  would  like  to  be  kind  to  anyone  trying  to  understand  our  proof, 
so  we  repeat  line  3  so  that  the  sentences  R  and  -iR  are  side  by  side,  and  it  is  very  easy  to  see 
that  something  has  gone  wrong.  That  is,  if  we  have  proven  both  R  and  ->R,  then  we  have 
proven  something  false. 

Our  reasoning  now  goes  like  this.  What  went  wrong?  Line  8  is  a  correct  use  of  repetition; 
line  7  comes  from  a  correct  use  of  modus  tollendo  ponens;  line  6  from  a  correct  use  of 
modus  ponens;  line  5  from  a  correct  use  of  double  negation.  So,  we  did  not  make  a  mistake 
in  our  reasoning.  We  used  lines  1,  2,  and  3,  but  those  are  premises  that  we  agreed  to  assume 
are  correct.  This  leaves  line  4.  That  must  be  the  source  of  my  contradiction.  It  must  be 
false.  If  line  4  is  false,  then  -iP  is  true. 


Some  people  consider  indirect  proofs  less  strong  than  direct  proofs.  There  are  many,  and 
complex,  reasons  for  this.  But,  for  our  propositional  logic,  none  of  these  reasons  apply. 
This  is  because  it  is  possible  to  prove  that  our  propositional  logic  is  consistent.  This  means, 
it  is  possible  to  prove  that  our  propositional  logic  cannot  prove  a  falsehood  unless  one  first 
introduces  a  falsehood  into  the  system.  (It  is  generally  not  possible  to  prove  that  more 
powerful  and  advanced  logical  or  mathematical  systems  are  consistent,  from  inside  those 
systems;  for  example,  one  cannot  prove  in  arithmetic  that  arithmetic  is  consistent.)  Given 
that  we  can  be  certain  of  the  consistency  of  the  propositional  logic,  we  can  be  certain  that 
in  our  propositional  logic  an  indirect  proof  is  a  good  form  of  reasoning.  We  know  that  if 
we  prove  a  falsehood,  we  must  have  put  a  falsehood  in;  and  if  we  are  confident  about  all  the 
other  assumptions  (that  is,  the  premises)  of  our  proof  except  for  the  assumption  for  indirect 
derivation,  then  we  can  be  confident  that  this  assumption  for  indirect  derivation  must  be 
the  source  of  the  falsehood. 


A  note  about  terminology  is  required  here.  The  word  “contradiction”  gets  used 
ambiguously  in  most  logic  discussions.  It  can  mean  a  situation  like  we  see  above,  where 
two  sentences  are  asserted,  and  these  sentences  cannot  both  be  true.  Or  it  can  mean  a  single 
sentence  that  cannot  be  true.  An  example  of  such  a  sentence  is  (PA-iP).  The  truth  table  for 
this  sentence  is: 
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p 

-.P 

(P  A  -.P) 

T 

F 

F 

F 

T 

F 

Thus,  this  kind  of  sentence  can  never  be  true,  regardless  of  the  meaning  of  P. 

To  avoid  ambiguity,  in  this  text,  we  will  always  call  a  single  sentence  that  cannot  be  true  a 
“contradictory  sentence”.  Thus,  (PA-iP)  is  a  contradictory  sentence.  Situations  where  two 
sentences  are  asserted  that  cannot  both  be  true  will  be  called  a  “contradiction”. 

8.3  Our  example,  and  other  examples 

We  can  reconstruct  a  version  of  Galileo’s  argument  now.  We  will  use  the  following  key. 

P:  There  are  actual  infinities  (including  the  natural  numbers  and  the  square 
numbers). 

Q:  There  is  a  one-to-one  correspondence  between  the  natural  numbers  and  the 
square  numbers. 

R:  The  size  of  the  set  of  the  natural  numbers  and  the  size  of  the  set  of  the  square 
numbers  are  the  same. 

S:  All  the  square  numbers  are  natural  numbers. 

T:  Some  of  the  natural  numbers  are  not  square  numbers. 

U:  There  are  more  natural  numbers  than  square  numbers. 

With  this  key,  the  argument  will  be  translated: 

(P-Q) 

(Q-R) 

(P-(SAT)) 

((SAT)->U) 

(U—R) 


-.P 


And  we  can  prove  this  is  a  valid  argument  by  using  indirect  derivation: 


94  A  Concise  Introduction  to  Logic 


1.  (P  — >■  Q)  premise 

2.  (Q  — y  R)  premise 

3.  (P  —¥  (S  A  T))  premise 

4.  ((SAT)  — »■  U )  premise 

5.  (U  -A  -iR)  premise 


6.  -.-.P 

7.  P 

8.  Q 

9.  R 

10.  (SAT) 

11.  U 

12.  -iR 

13.  R 
14. 


assumption  for  indirect  derivation 

double  negation,  6 
modus  ponens,  1,  7 
modus  ponens,  2,  8 
modus  ponens,  3,  7 
modus  ponens,  4,  10 
modus  ponens,  5,  11 
repeat,  9 

indirect  derivation,  6-13 


On  line  6,  we  assumed  mP  because  Galileo  believed  that  -iP  and  aimed  to  prove  that  ->P. 
That  is,  he  believed  that  there  are  no  actual  infinities,  and  so  assumed  that  it  was  false  to 
believe  that  it  is  not  the  case  that  there  are  no  actual  infinities.  This  falsehood  will  lead  to 
other  falsehoods,  exposing  itself. 

For  those  who  are  interested:  Galileo  concluded  that  there  are  no  actual  infinities  but  there 
are  potential  infinities.  Thus,  he  reasoned,  it  is  not  the  case  that  all  the  natural  numbers  exist 
(in  some  sense  of  “exist”),  but  it  is  true  that  you  could  count  natural  numbers  forever.  Many 
philosophers  before  and  after  Galileo  held  this  view;  it  is  similar  to  a  view  held  by  Aristotle, 
who  was  an  important  logician  and  philosopher  writing  nearly  two  thousand  years  before 
Galileo. 


Note  that  in  an  argument  like  this,  you  could  reason  that  not  the  assumption  for  indirect 
derivation,  but  rather  one  of  the  premises  was  the  source  of  the  contradiction.  Today, 
most  mathematicians  believe  this  about  Galileo’s  argument.  A  logician  and  mathematician 
named  Georg  Cantor  (1845-1918),  the  inventor  of  set  theory,  argued  that  infinite  sets  can 
have  proper  subsets  of  the  same  size.  That  is,  Cantor  denied  premise  4  above:  even  though 
all  the  square  numbers  are  natural  numbers,  and  not  all  natural  numbers  are  square  numbers, 
it  is  not  the  case  that  these  two  sets  are  of  different  size.  Cantor  accepted  however  premise  2 
above,  and,  therefore,  believed  that  the  size  of  the  set  of  natural  numbers  and  the  size  of  the 
set  of  square  numbers  is  the  same.  Today,  using  Cantor’s  reasoning,  mathematicians  and 
logicians  study  infinity,  and  have  developed  a  large  body  of  knowledge  about  the  nature  of 
infinity.  If  this  interests  you,  see  section  17.5. 

Let  us  consider  another  example  to  illustrate  indirect  derivation.  A  very  useful  set  of 
theorems  are  today  called  “De  Morgan’s  Theorems”,  after  the  logician  Augustus  De 
Morgan  (1806-1871).  We  cannot  state  these  fully  until  chapter  9,  but  we  can  state  their 
equivalent  in  English:  DeMorgan  observed  that  -i(PvQ)  and  (iPAiQ)  are  equivalent,  and 
also  that  -i(PAQ)  and  (iPv-iQ)  are  equivalent.  Given  this,  it  should  be  a  theorem  of  our 
language  that  (-i(PvQ)->(-iPa-iQ)).  Let’s  prove  this. 
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The  whole  formula  is  a  conditional,  so  we  will  use  a  conditional  derivation.  Our  proof 
must  thus  begin: 


1.  ->(P  V  Q)  assumption  for  conditional  derivation 


To  complete  the  conditional  derivation,  we  must  prove  (iPAiQ).  This  is  a  conjunction, 
and  our  rule  for  showing  conjunctions  is  adjunction.  Since  using  this  rule  might  be 
our  best  way  to  show  (iPAiQ),  we  can  aim  to  show  ->P  and  then  show  iQ,  and  then 
perform  adjunction.  But,  we  obviously  have  very  little  to  work  with— just  line  1,  which  is 
a  negation.  In  such  a  case,  it  is  typically  wise  to  attempt  an  indirect  proof.  Start  with  an 
indirect  proof  of  -iP. 

1.  —  (P  V  Q) 

2.  — P 

3.  P 


We  now  need  to  find  a  contradiction — any  contradiction.  But  there  is  an  obvious  one 
already.  Line  1  says  that  neither  P  nor  Q  is  true.  But  line  3  says  that  P  is  true.  We  must 
make  this  contradiction  explicit  by  finding  a  formula  and  its  denial.  We  can  do  this  using 
addition. 

_I(P  V  Q) 

2.  — P 

3.  P 

4.  (PvQ) 

5.  -(PVQ) 

-P 

To  complete  the  proof,  we  will  use  this  strategy  again. 


assumption  for  conditional  derivation 
assumption  for  indirect  derivation 

double  negation,  2 
addition,  3 
repeat,  1 

indirect  derivation  2-5 


assumption  for  conditional  derivation 
assumption  for  indirect  derivation 
double  negation,  2 
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1.  ~'(P  V  Q) 


assumption  for  conditional  derivation 


2.  — P 

3.  P 

4.  (PvQ) 

5.  — (P  V  Q) 
6.  -P 

I  7.  — Q 


assumption  for  indirect  derivation 

double  negation,  2 
addition,  3 
repeat,  1 

indirect  derivation  2-5 
assumption  for  indirect  derivation 


8.  Q 

9.  (PVQ) 

10.  -(PVQ) 

11.  -Q 

12.  (— P  A  —  Q) 

13.  (-(PVQ)  ->  (-PA-Q)) 


double  negation,  7 
addition,  8 
repeat,  1 

indirect  derivation  7-10 
adjunction,  6,  11 
conditional  derivation  1-12 


We  will  prove  De  Morgan’s  theorems  as  problems  for  chapter  9. 

Here  is  a  general  rule  of  thumb  for  doing  proofs:  When  proving  a  conditional,  always 
do  conditional  derivation;  otherwise,  try  direct  derivation;  if  that  fails,  then,  try  indirect 
derivation. 


1. 


a. 

b. 


c. 

d. 


e. 

f. 


2. 


a. 

b. 


c. 

d. 


3. 


8.4  Problems 


Complete  the  following  proofs.  Each  will  require  an  indirect  derivation.  The  last  two 
are  challenging. 

Premises:  (P->R),  (Q-R),  (PvQ).  Conclusion:  R. 

Premises:  ((PvQ)-fR),  -R.  Conclusion:  -iP. 

Premise:  (-.PA-.Q),  Conclusion:  -(PvQ). 

Premise:  (P-R),  (Q-S),  ->(R  A  S).  Conclusion:  -(P  A  Q)- 
Premise:  ->R,  ((P^R)  v  (Q^R)).  Conclusion:  (iP  v  iQ). 

Premise:  -(R  v  S),  (P-R),  (Q-S).  Conclusion:  -(PvQ). 

Prove  the  following  are  theorems. 

-.(PA-.P), 

-((P— P)A(-P-P)). 

(— P- >— (paQ))* 

((PA-Q)— (P-Q)). 


In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 
premises.  Your  argument  should  just  be  a  paragraph  (not  an  ordered  list  of  sentences  or 
anything  else  that  looks  like  formal  logic).  Translate  it  into  propositional  logic  and 
prove  it  is  valid  using  an  indirect  derivation. 
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[10]  This  translation  of  the  title  of  Galileo’s  book  has  become  the  most  common,  although  a 
more  literal  one  would  have  been  Mathematical  Discourses  and  Demonstrations.  Translations 
of  the  book  include  Drake  (1974). 


9. "...  if  and  only  if...",  Using 
Theorems 


9.1  A  historical  example 

The  philosopher  David  Hume  (1711-1776)  is  remembered  for  being  a  brilliant  skeptical 
empiricist.  A  person  is  a  skeptic  about  a  topic  if  that  person  both  has  very  strict  standards  for 
what  constitutes  knowledge  about  that  topic  and  also  believes  we  cannot  meet  those  strict 
standards.  Empiricism  is  the  view  that  we  primarily  gain  knowledge  through  experience, 
particular  experiences  of  our  senses.  In  his  book.  An  Inquiry  Concerning  Human 
Understanding  Hume  lays  out  his  principles  for  knowledge,  and  then  advises  us  to  clean  up 
our  libraries: 

When  we  run  over  libraries,  persuaded  of  these  principles,  what  havoc  must  we 
make?  If  we  take  in  our  hand  any  volume  of  divinity  or  school  metaphysics,  for 
instance,  let  us  ask.  Does  it  contain  any  abstract  reasoning  concerning  quantity  or 
number?  No.  Does  it  contain  any  experimental  reasoning  concerning  matter  of  fact 
and  existence?  No.  Commit  it  then  to  the  flames,  for  it  can  contain  nothing  but 
sophistry  and  illusion. [11] 

Hume  felt  that  the  only  sources  of  knowledge  were  logical  or  mathematical  reasoning 
(which  he  calls  above  “abstract  reasoning  concerning  quantity  or  number”)  or  sense 
experience  (“experimental  reasoning  concerning  matter  of  fact  and  existence”).  Hume  is 
led  to  argue  that  any  claims  not  based  upon  one  or  the  other  method  is  worthless. 

We  can  reconstruct  Hume’s  argument  in  the  following  way.  Suppose  t  is  some  topic  about 
which  we  claim  to  have  knowledge.  Suppose  that  we  did  not  get  this  knowledge  from 
experience  or  logic.  Written  in  English,  we  can  reconstruct  his  argument  in  the  following 
way: 


We  have  knowledge  about  t  if  and  only  if  our  claims  about  tare  learned  from 
experimental  reasoning  or  from  logic  or  mathematics. 

Our  claims  about  t  are  not  learned  from  experimental  reasoning. 

Our  claims  about  t  are  not  learned  from  logic  or  mathematics. 


We  do  not  have  knowledge  about  t. 

What  does  that  phrase  “if  and  only  if”  mean?  Philosophers  think  that  it,  and  several 
synonymous  phrases,  are  used  often  in  reasoning.  Leaving  “if  and  only”  unexplained  for 
now,  we  can  use  the  following  translation  key  to  write  up  the  argument  in  a  mix  of  our 
propositional  logic  and  English. 
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P:  We  have  knowledge  about  t. 

Q:  Our  claims  about  t  are  learned  from  experimental  reasoning. 
R:  Our  claims  about  t  are  learned  from  logic  or  mathematics. 
And  so  we  have: 

P  if  and  only  if  (QvR) 

-Q 

-.R 


-.P 

Our  task  is  to  add  to  our  logical  language  an  equivalent  to  “if  and  only  if”.  Then  we  can 
evaluate  this  reformulation  of  Hume’s  argument. 

9.2  The  biconditional 

Before  we  introduce  a  symbol  synonymous  with  “if  and  only  if”,  and  then  lay  out  its  syntax 
and  semantics,  we  should  start  with  an  observation.  A  phrase  like  “P  if  and  only  if  Q” 
appears  to  be  an  abbreviated  way  of  saying  “P  if  Q  and  P  only  if  Q”.  Once  we  notice 
this,  we  do  not  have  to  try  to  discern  the  meaning  of  “if  and  only  if”  using  our  expert 
understanding  of  English.  Instead,  we  can  discern  the  meaning  of  “if  and  only  if”  using  our 
already  rigorous  definitions  of  “if”,  “and”,  and  “only  if”.  Specifically,  “P  if  Q  and  P  only 
if  Q”  will  be  translated  “((Q^P)A(P^Q))”.  (If  this  is  unclear  to  you,  go  back  and  review 
section  2.2.)  Now,  let  us  make  a  truth  table  for  this  formula. 


p 

Q 

(Q-p) 

(P- 

♦Q) 

((Q-P)A(P-Q)) 

T 

T 

T 

T 

T 

T 

F 

T 

F 

F 

F 

T 

F 

T 

F 

F 

F 

T 

T 

T 

We  have  settled  the  semantics  for  “if  and  only  if”.  We  can  now  introduce  a  new  symbol 
for  this  expression.  It  is  traditional  to  use  the  double  arrow,  We  can  now  express  the 
syntax  and  semantics  of 

If  $  and  VP  are  sentences,  then 

(5>~ip) 


is  a  sentence.  This  kind  of  sentence  is  typically  called  a  “biconditional”. 
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The  semantics  is  given  by  the  following  truth  table. 


<t> 

W 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

T 

One  pleasing  result  of  our  account  of  the  biconditional  is  that  it  allows  us  to  succinctly 
explain  the  syntactic  notion  of  logical  equivalence.  We  say  that  two  sentences  $  and  T  are 
“equivalent”  or  “logically  equivalent”  if  (<P^VP)  is  a  theorem. 

9.3  Alternative  phrases 

In  English,  it  appears  that  there  are  several  phrases  that  usually  have  the  same  meaning  as 
the  biconditional.  Each  of  the  following  sentences  would  be  translated  as  (P^Q). 

P  if  and  only  if  Q. 

P  just  in  case  Q. 

P  is  necessary  and  sufficient  for  Q. 

P  is  equivalent  to  Q. 

9.4  Reasoning  with  the  biconditional 

ELow  can  we  reason  using  a  biconditional?  At  first,  it  would  seem  to  offer  little  guidance.  If 
I  know  that  (P^Q),  I  know  that  P  and  Q  have  the  same  truth  value,  but  from  that  sentence 
alone  I  do  not  know  if  they  are  both  true  or  both  false.  Nonetheless,  we  can  take  advantage 
of  the  semantics  for  the  biconditional  to  observe  that  if  we  also  know  the  truth  value  of 
one  of  the  sentences  constituting  the  biconditional,  then  we  can  derive  the  truth  value  of 
the  other  sentence.  This  suggests  a  straightforward  set  of  rules.  These  will  actually  be  four 
rules,  but  we  will  group  them  together  under  a  single  name,  “equivalence”: 

(<P~T) 

$ 


W 


and 
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(d>^L) 

W 


$ 


and 


-.W 


and 


(d)«W) 

-.W 


What  if  we  instead  are  trying  to  show  a  biconditional?  Here  we  can  return  to  the  insight 
that  the  biconditional  is  equivalent  to  (($^vI/)A(vI/->$)).  If  we  could  prove  both 

(5>^)  and  (^$)  ,  we  will  know  that  (<t>^'F)  must  be  true. 

We  can  call  this  rule  “bicondition”.  It  has  the  following  form: 

(^ip) 


(3>~q/) 

This  means  that  often  when  we  aim  to  prove  a  biconditional,  we  will  undertake  two 
conditional  derivations  to  derive  two  conditionals,  and  then  use  the  bicondition  rule.  That 
is,  many  proofs  of  biconditionals  have  the  following  form: 
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>  (<p  ->■  'ip) 

(p 

>  {$-+<!>) 

{<P  ^  VO 


9.5  Returning  to  Hume 

We  can  now  see  if  we  are  able  to  prove  Hume’s  argument.  Given  now  the  new 
biconditional  symbol,  we  can  begin  a  direct  proof  with  our  three  premises. 


1.  (P  o  (Q  V  R)) 

2.  - iQ 

3.  - iR 


premise 

premise 

premise 


We  have  already  observed  that  we  think  (QvR)  is  false  because  -iQ  and  ->R.  So  let’s  prove 
-(QvR).  This  sentence  cannot  be  proved  directly,  given  the  premises  we  have;  and  it 
cannot  be  proven  with  a  conditional  proof,  since  it  is  not  a  conditional.  So  let’s  try  an 
indirect  proof.  We  believe  that  -i(QvR)  is  true,  so  well  assume  the  denial  of  this  and  show 
a  contradiction. 


1.  (P  (QVR)) 

2.  ~ iQ 

3.  “i R 


premise 

premise 

premise 


4.  -.-.(QVR) 

5.  (Q  V  R) 

6.  R 

7.  -.R 

8.  — . (Q  V  R) 

9.  —i  P 


assumption  for  indirect  derivation 

double  negation,  4 
modus  tollendo  ponens,  5,  2 
repetition,  3 
indirect  proof,  4-7 
equivalence,  1,  8 
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Hume’s  argument,  at  least  as  we  reconstructed  it,  is  valid. 

Is  Hume’s  argument  sound?  Whether  it  is  sound  depends  upon  the  first  premise  above 
(since  the  second  and  third  premises  are  abstractions  about  some  topic  t).  Most  specifically, 
it  depends  upon  the  claim  that  we  have  knowledge  about  something  just  in  case  we  can 
show  it  with  experiment  or  logic.  Hume  argues  we  should  distrust — indeed,  we  should 
burn  texts  containing — claims  that  are  not  from  experiment  and  observation,  or  from  logic 
and  math.  But  consider  this  claim:  we  have  knowledge  about  a  topic  t  if  and  only  if 
our  claims  about  t  are  learned  from  experiment  or  our  claims  about  t  are  learned  from  logic 
or  mathematics. 

Did  Hume  discover  this  claim  through  experiments?  Or  did  he  discover  it  through  logic? 
What  fate  would  Hume’s  book  suffer,  if  we  took  his  advice? 

9.6  Some  examples 

It  can  be  helpful  to  prove  some  theorems  that  make  use  of  the  biconditional,  in  order  to 
illustrate  how  we  can  reason  with  the  biconditional. 

Here  is  a  useful  principle.  If  two  sentences  have  the  same  truth  value  as  a  third  sentence, 
then  they  have  the  same  truth  value  as  each  other.  We  state  this  as 
(((P~Q)*(R~Q)HP~R)).  To  illustrate  reasoning  with  the  biconditional,  let  us  prove  this 
theorem. 


This  theorem  is  a  conditional,  so  it  will  require  a  conditional  derivation.  The  consequent 
of  the  conditional  is  a  biconditional,  so  we  will  expect  to  need  two  conditional  derivations, 
one  to  prove  (P-R)  and  one  to  prove  (R-fP).  The  proof  will  look  like  this.  Study  it 
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_1.  ((PoQ)  a(RhQ)) 

2.  (P  ++  Q) 

3.  (R  o  Q) 

4.  P 

5.  Q 

6.  R 

7.  (P  ->  R) 

I  8.  R 

9.  Q 

10.  R 

11.  (R  -*  P) 

12.  (P  o  R) 

13.  (((P  ^  Q)  A  (R  <4  Q))  -4  (P  <4  R)) 


assumption  for  conditional  derivation 

simplification,  1 
simplification,  1 

assumption  for  conditional  derivation 

equivalence,  2,  4 
equivalence,  3,  5 

conditional  derivation,  4-6 
assumption  for  conditional  derivation 

equivalence,  3,  8 
equivalence,  2,  9 

conditional  derivation,  8-10 
bincondition,  7,  11 

conditional  derivation  1-12 


We  have  mentioned  before  the  principles  that  we  associate  with  the  mathematician 
Augustus  De  Morgan  (1806-1871),  and  which  today  are  called  “De  Morgan’s  Laws”  or 
the  “De  Morgan  Equivalences”.  These  are  the  recognition  that  i(PvQ)  and  (-.PA-.Q)  are 
equivalent,  and  also  that  i(PAQ)  and  (iPv-iQ)  are  equivalent.  We  can  now  express  these 
with  the  biconditional.  The  following  are  theorems  of  our  logic: 

HPvQHM>a^Q)) 

HPaQHM>v^Q)) 

We  will  prove  the  second  of  these  theorems.  This  is  perhaps  the  most  difficult  proof  we 
have  seen;  it  requires  nested  indirect  proofs,  and  a  fair  amount  of  cleverness  in  finding  what 
the  relevant  contradiction  will  be. 


106  A  Concise  Introduction  to  Logic 


1.  ->(P  A  Q) 

2.  — >(— >P  V  -iQ) 

I  3.  -P 

4.  (-P  V-Q) 

5.  — (— P  V  — Q) 

6.  P 


8.  (-P  V-Q) 

9.  -(-P  V-Q) 

10.  Q 

11.  (PAQ) 

12.  -(PAQ) 

13.  (— P  V  — Q) 

14.  (— '(P  A  Q)  — >  (— P  V  — Q)) 
15.  (-P  V-Q) 

I  16.  —(PAQ) 

17.  (PAQ) 

18.  P 

19.  — P 

20.  -Q 

21.  Q 

22.  -(PAQ) 

23.  ((-PV-Q)  ->  -(PAQ)) 

24.  (— (P  A  Q)  (P  V  — Q)) 


assumption  for  conditional  derivation 
assumption  for  indirect  derivation 
assumption  for  indirect  derivation 

addition,  3 
repeat,  2 

indirect  derivation,  3-5 
assumption  for  indirect  derivation 

addition,  7 
repeat,  2 

indirect  derivation,  6-9 
adjunction,  6,  10 
repeat,  1 

indirect  derivation,  2-12 

conditional  derivation,  1-13 
assumption  for  conditional  derivation 

assumption  for  indirect  derivation 

double  negation  16 
simplification,  17 
double  negation,  18 
modus  tollendo  ponens,  15,  19 
simplification,  17 

indirect  derivation  16-21 

conditional  derivation,  15-22 
bicondition,  14,  23 


9.7  Using  theorems 

Every  sentence  of  our  logic  is,  in  semantic  terms,  one  of  three  kinds.  It  is  either  a  tautology, 
a  contradictory  sentence,  or  a  contingent  sentence.  We  have  already  defined  “tautology”  (a 
sentence  that  must  be  true)  and  “contradictory  sentence”  (a  sentence  that  must  be  false).  A 
contingent  sentence  is  a  sentence  that  is  neither  a  tautology  nor  a  contradictory  sentence. 
Thus,  a  contingent  sentence  is  a  sentence  that  might  be  true,  or  might  be  false. 
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Here  is  an  example  of  each  kind  of  sentence: 

(Pv-iP) 

(P~-.P) 

P 

The  first  is  a  tautology,  the  second  is  a  contradictory  sentence,  and  the  third  is  contingent. 
We  can  see  this  with  a  truth  table. 


p 

-.P 

(Pv-.P) 

(P— P) 

P 

T 

F 

T 

F 

T 

F 

T 

T 

F 

F 

Notice  that  the  negation  of  a  tautology  is  a  contradiction,  the  negation  of  a  contradiction 
is  a  tautology,  and  the  negation  of  a  contingent  sentence  is  a  contingent  sentence. 

-■(Pv-iP) 

i(P«-.P) 


-.P 


P 

-.P 

(Pv->P) 

-.(Pv-.P) 

(P—P) 

-(P— P) 

T 

F 

T 

F 

F 

T 

F 

T 

T 

F 

F 

T 

A  moment’s  reflection  will  reveal  that  it  would  be  quite  a  disaster  if  either  a  contradictory 
sentence  or  a  contingent  sentence  were  a  theorem  of  our  propositional  logic.  Our  logic 
was  designed  to  produce  only  valid  arguments.  Arguments  that  have  no  premises,  we 
observed,  should  have  conclusions  that  must  be  true  (again,  this  follows  because  a  sentence 
that  can  be  proved  with  no  premises  could  be  proved  with  any  premises,  and  so  it  had  better 
be  true  no  matter  what  premises  we  use).  If  a  theorem  were  contradictory,  we  would  know 
that  we  could  prove  a  falsehood.  If  a  theorem  were  contingent,  then  sometimes  we  could 
prove  a  falsehood  (that  is,  we  could  prove  a  sentence  that  is  under  some  conditions  false). 

And,  given  that  we  have  adopted  indirect  derivation  as  a  proof  method,  it  follows  that 
once  we  have  a  contradiction  or  a  contradictory  sentence  in  an  argument,  we  can  prove 
anything. 

Theorems  can  be  very  useful  to  us  in  arguments.  Suppose  we  know  that  neither  Smith  nor 
Jones  will  go  to  London,  and  we  want  to  prove,  therefore,  that  Jones  will  not  go  to  London. 

If  we  allowed  ourselves  to  use  one  of  De  Morgan’s  theorems,  we  could  make  quick  work 
of  the  argument.  Assume  the  following  key. 


P:  Smith  will  go  to  London. 
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Q:  Jones  will  go  to  London. 
And  we  have  the  following  argument: 

I  1.  “ 1  (P  V  Q) 


2.  (— i(P  V  Q)  (“i P  A  “ 1 Q)) 

3.  (~ >P  A  -iQ) 

4. 


premise 


theorem 
equivalence,  2,  1 
simplification,  3 


This  proof  was  made  very  easy  by  our  use  of  the  theorem  at  line  2. 

There  are  two  things  to  note  about  this.  First,  we  should  allow  ourselves  to  do  this,  because 
if  we  know  that  a  sentence  is  a  theorem,  then  we  know  that  we  could  prove  that  theorem 
in  a  subproof.  That  is,  we  could  replace  line  2  above  with  a  long  subproof  that  proves  (i(P 
v  QH-.P  A  iQ)),  which  we  could  then  use.  But  if  we  are  certain  that  HP  v  QH-.P A 
-Q))  is  a  theorem,  we  should  not  need  to  do  this  proof  again  and  again,  each  time  that  we 
want  to  make  use  of  the  theorem. 

The  second  issue  that  we  should  recognize  is  more  subtle.  There  are  infinitely  many 
sentences  of  the  form  of  our  theorem,  and  we  should  be  able  to  use  those  also.  For  example, 
the  following  sentences  would  each  have  a  proof  identical  to  our  proof  of  the  theorem  (-i(P 
v  Q)<-»(-iP  A  -iQ)),  except  that  the  letters  would  be  different: 

HR  vS)«  (->R  a  -.S)) 

HT  vU)«  (-.T  a  --U)) 

(-(V  v  W)  ~  (-.V  A  -.W)) 

This  is  hopefully  obvious.  Take  the  proof  of  (i(P  v  Q)<-»(-iP  A  -iQ)),  and  in  that  proof 
replace  each  instance  of  P  with  R  and  each  instance  of  Q  with  S,  and  you  would  have  a 
proof  of  (i(R  v  S)H--R  A  _,S)). 

But  here  is  something  that  perhaps  is  less  obvious.  Each  of  the  following  can  be  thought  of 
as  similar  to  the  theorem  HPvQH-PA-Q)). 

H(paQ)  v  (ras))H-i(paq)  a  -(Ras))) 

HT  V  (Q  V  V)H-.T  A  ->(Q  V  v)) 

K(Q~P)  V  (-R— ■  Q)HHQ~P)  A  -i(-iR-»-iQ))) 


For  example,  if  one  took  a  proof  of  (-i(P  v  QHH*  A  _1Q))  and  replaced  each  initial  instance 
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of  P  with  (Q<-»P)  and  each  initial  instance  of  Q  with  (-iR->-iQ),  then  one  would  have  a 
proof  of  the  theorem  H(Q^P)  v  (-iR-»-iQ))<-»(-i(Q<-»P)  A  -i(-iR-»-iQ))). 

We  could  capture  this  insight  in  two  ways.  We  could  state  theorems  of  our  metalanguage 
and  allow  that  these  have  instances.  Thus,  we  could  take  (i($  v  VF)  <-»  (-1$  A  -AP))  as  a 
metalanguage  theorem,  in  which  we  could  replace  each  $  with  a  sentence  and  each  W  with 
a  sentence  and  get  a  particular  instance  of  a  theorem.  An  alternative  is  to  allow  that  from  a 
theorem  we  can  produce  other  theorems  through  substitution.  For  ease,  we  will  take  this 
second  strategy. 

Our  rule  will  be  this.  Once  we  prove  a  theorem,  we  can  cite  it  in  a  proof  at  any  time.  Our 
justification  is  that  the  claim  is  a  theorem.  We  allow  substitution  of  any  atomic  sentence  in 
the  theorem  with  any  other  sentence  if  and  only  if  we  replace  each  initial  instance  of  that 
atomic  sentence  in  the  theorem  with  the  same  sentence. 

Before  we  consider  an  example,  it  is  beneficial  to  list  some  useful  theorems.  There  are 
infinitely  many  theorems  of  our  language,  but  these  ten  are  often  very  helpful.  A  few  we 
have  proved.  The  others  can  be  proved  as  an  exercise. 

T1  (P  v  -iP) 

T2  HP-Q)  -  (PAiQ)) 

T3  (->(P  vQ)«  (iP  A  ->Q)) 

T4  ((->P  v  iQ)  i(P  A  Q)) 

T5  HP  -  Q)  -  (P  -  -.Q)) 

T6  H  ->  (P  -  Q)) 

T7  (P-(Q-P)) 

T8  ((P-(Q-R))  -  ((P-Q)  -  (P-R))) 

T9  ((iP^iQ)  ->  (H-Q)  -P)) 

T10  ((P-Q)  -  (-.Q— . P)) 

Some  examples  will  make  the  advantage  of  using  theorems  clear.  Consider  a  different 
argument,  building  on  the  one  above.  We  know  that  neither  is  it  the  case  that  if  Smith 
goes  to  London,  he  will  go  to  Berlin,  nor  is  it  the  case  that  if  Jones  goes  to  London  he  will 
go  to  Berlin.  We  want  to  prove  that  it  is  not  the  case  that  Jones  will  go  to  Berlin.  We  add 
the  following  to  our  key: 


R:  Smith  will  go  to  Berlin. 
S:  Jones  will  go  to  Berlin. 
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And  we  have  the  following  argument: 


1. -.((P -►  R)  V  (Q -►  S)) 


premise 


O  (_I((P  -A  R)  V  (Q  -A  S))  «a 

HPAR)An(Q-iS))) 

3.  HP  — >  R)  A  ->(Q  -4  S)) 

4.  -.(Q  -4  S) 

5.  (-.(Q  ^  S)  o  (Q  A  -iS)) 

6.  (Q  A  ■" iS) 

7.  -iS 


theorem  T3 

equivalence,  2,  1 
simplification,  3 
theorem  T2 
equivalence,  5,  4 
simplification,  6 


Using  theorems  made  this  proof  much  shorter  than  it  might  otherwise  be.  Also,  theorems 
often  make  a  proof  easier  to  follow,  since  we  recognize  the  theorems  as  tautologies — as 
sentences  that  must  be  true. 


9.8  Problems 

1.  Prove  each  of  the  following  arguments  is  valid. 

a.  Premises:  P,  -iQ.  Conclusion:  -HQ). 

b.  Premises:  (iPvQ),  (Pv-iQ).  Conclusion:  HQ). 

c.  Premises:  HQ),  Hs)  •  Conclusion:  ((parHqas)). 

2.  Prove  each  of  the  following  theorems. 

a.  T1 

b.  T2 

c.  T5 

d.  T6 

e.  T7 

f.  T8 

g.  T9 

h.  ((PAQ)H-Pv-iQ)) 

i.  (HQHHQ)) 

3.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 
premises,  at  least  one  of  which  is  a  biconditional.  Your  argument  should  just  be  a 
paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  like  formal 
logic).  Translate  it  into  propositional  logic  and  prove  it  is  valid. 

4.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 
premises,  and  with  a  conclusion  that  is  a  biconditional  Your  argument  should  just  be  a 
paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  formal  like 
logic).  Translate  it  into  propositional  logic  and  prove  it  is  valid. 
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[l  l]  From  Hume’s  Enquiry  Concerning  Human  Understanding,  p.  161  in  Selby-Bigge  and 
Nidditch  (1995  [1777]). 


1 0.  Summary  of  Propositional  Logic 

10.1  Elements  of  the  language 

•  Principle  of  Bivalence:  each  sentence  is  either  true  or  false,  never  both,  never 
neither. 

•  Each  atomic  sentence  is  a  sentence. 

•  Syntax:  if  <t>  and  VF  are  sentences,  then  the  following  are  also  sentences 

- 

-  (<D-W) 

-  ($A^F) 

-  ($vW) 

-  (5>~^F) 

•  Semantics:  if  $  and  VF  are  sentences,  then  the  meanings  of  the  connectives  are 
fully  given  by  their  truth  tables.  These  truth  tables  are: 


$ 

T 

F 

F 

T 

<t> 

W 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

<5 

W 

($  A 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

F 
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<t> 

W 

(<t>vT) 

T 

T 

T 

T 

F 

T 

F 

T 

T 

F 

F 

F 

<5 

W 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

T 

•  A  sentence  of  the  propositional  logic  that  must  be  true  is  a  tautology. 

•  A  sentence  that  must  be  false  is  a  contradictory  sentence. 

•  A  sentence  that  is  neither  a  tautology  nor  a  contradictory  sentence  is  a 
contingent  sentence. 

•  Two  sentences  4>  and  VF  are  equivalent,  or  logically  equivalent,  when  is 

a  theorem. 

1 0.2  Reasoning  with  the  language 

•  An  argument  is  an  ordered  list  of  sentences,  one  sentence  of  which  we  call  the 
“conclusion”  and  the  others  of  which  we  call  the  “premises”. 

•  A  valid  argument  is  an  argument  in  which:  necessarily,  if  the  premises  are  true, 
then  the  conclusion  is  true. 

•  A  sound  argument  is  a  valid  argument  with  true  premises. 

•  Inference  rules  allow  us  to  write  down  a  sentence  that  must  be  true,  assuming 
that  certain  other  sentences  are  true.  We  say  that  the  new  sentence  is  “derived 
from”  those  other  sentences  using  the  inference  rule. 

•  Schematically,  we  can  write  out  the  inference  rules  in  the  following  way  (think 
of  these  as  saying,  if  you  have  written  the  sentence(s)  above  the  line,  then  you 
can  write  the  sentence  below  the  line): 
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Modus  ponens 

Modus  tollens 

Double  negation 

Double  negation 

($^T) 

<t> 

($-'F) 

-AF 

<t> 

-1-1$ 

W 

-.$ 

$ 

Addition 

Addition 

Modus  tollendo  ponens 

Modus  tollendo  ponens 

<t> 

W 

4 

< 

2§ 

($vW) 

-AF 

($vT) 

<& 

< 

W 

$ 

Adjunction 

Simplification 

Simplification 

Bicondition 

<t> 

W 

($  A  'F) 

($  AT) 

($->vF) 

(vF-»$) 

(<t>  A  T) 

<t> 

T 

($«T) 

Equivalence 

Equivalence 

Equivalence 

Equivalence 

($~T) 

<5 

($~T) 

W 

($~T) 

-.$ 

($«q/) 

-AF 

W 

<t> 

nf 

-.$ 

•  A  proof  (or  derivation)  is  a  syntactic  method  for  showing  an  argument  is  valid. 

Our  system  has  three  kinds  of  proof  (or  derivation):  direct,  conditional,  and 
indirect. 

•  A  direct  proof  (or  direct  derivation)  is  an  ordered  list  of  sentences  in  which 
every  sentence  is  either  a  premise  or  is  derived  from  earlier  lines  using  an 
inference  rule.  The  last  line  of  the  proof  is  the  conclusion. 

•  A  conditional  proof  (or  conditional  derivation)  is  an  ordered  list  of  sentences  in 
which  every  sentence  is  either  a  premise,  is  the  special  assumption  for 
conditional  derivation,  or  is  derived  from  earlier  lines  using  an  inference  rule.  If 
the  assumption  for  conditional  derivation  is  5>,  and  we  derive  as  some  step  in  the 
proof  VF,  then  we  can  write  after  this  ($^VF)  as  our  conclusion. 

•  An  indirect  proof  (or  indirect  derivation,  and  also  known  as  a  reductio  ad 
absurdum)  is:  an  ordered  list  of  sentences  in  which  every  sentence  is  either  l)  a 
premise,  2)  the  special  assumption  for  indirect  derivation  (also  sometimes  called 
the  “assumption  for  reductio”),  or  3)  derived  from  earlier  lines  using  an 
inference  rule.  If  our  assumption  for  indirect  derivation  is  -■$,  and  we  derive  as 
some  step  in  the  proof  VP  and  also  as  some  step  of  our  proof  -AF,  then  we 
conclude  that  5>. 

•  We  can  use  Fitch  bars  to  write  out  the  three  proof  schemas  in  the  following 
way: 
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>| 

—i  if; 

•  A  sentence  that  we  can  prove  without  premises  is  a  theorem. 

•  Suppose  $  is  a  theorem,  and  it  contains  the  atomic  sentences  Pi. .  .Pn.  If  we 
replace  each  and  every  occurrence  of  one  of  those  atomic  sentences  P;  in  $  with 
another  sentence  W,  the  resulting  sentence  is  also  a  theorem.  This  can  be 
repeated  for  any  atomic  sentences  in  the  theorem. 


Part  II:  First  Order  Logic 


1 1 .  Names  and  predicates 

1 1 .1  A  limitation  of  the  propositional  logic 

The  propositional  logic  is  a  perfect  language  for  what  it  does.  It  is  rigorously  precise 
and  easy  to  use.  But  it  is  not  the  only  kind  of  logic  that  philosophers  developed.  The 
philosopher  Aristotle  (384-322  BC)  wrote  several  books  on  logic,  and  famously,  he  used  the 
following  argument  as  one  of  his  examples. 

All  men  are  mortal. 

Socrates  is  a  man. 


Socrates  is  mortal. 

Aristotle  considered  this  an  example  of  a  valid  argument.  And  it  appears  to  be  one.  But 
let  us  translate  it  into  our  propositional  logic.  We  have  three  atomic  sentences.  Our 
translation  key  would  look  something  like  this: 

P:  All  men  are  mortal. 

Q:  Socrates  is  a  man. 

R:  Socrates  is  mortal. 

And  the  argument,  written  in  propositional  logic,  would  be 

P 

Q 


R 

This  argument  is  obviously  invalid.  What  went  wrong?  Somehow,  between  Aristotle’s 
argument  and  our  translation,  essential  information  was  lost.  This  information  was 
required  in  order  for  the  argument  to  be  valid.  When  we  lost  it,  we  ended  up  with  an 
argument  where  the  conclusion  could  be  false  (as  far  as  we  can  tell  from  the  shape  of  the 
argument  alone). 

It  seems  quite  clear  what  we  lost  in  the  translation.  There  are  parts  of  the  first  premise  that 
are  shared  by  the  other  two:  something  to  do  with  being  a  man,  and  being  mortal.  There 
is  a  part  of  the  second  sentence  shared  with  the  conclusion:  the  proper  name  “Socrates”. 
And  the  word  “All”  seems  to  be  playing  an  important  role  here. 
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Note  that  all  three  of  these  things  (those  adjective  phrases,  a  proper  name,  and  “all”)  are 
themselves  not  sentences.  To  understand  this  argument  of  Aristotle’s,  we  will  need  to  break 
into  the  atomic  sentences,  and  begin  to  understand  their  parts.  Doing  this  proved  to  be 
very  challenging — most  of  all,  making  sense  of  that  “all”  proved  challenging.  As  a  result, 
for  nearly  two  thousand  years,  we  had  two  logics  working  in  parallel:  the  propositional 
logic  and  Aristotle’s  logic.  It  was  not  until  late  in  the  nineteenth  century  that  we  developed 
a  clear  and  precise  understanding  of  how  to  combine  these  two  logics  into  one,  which  we 
will  call  “first  order  logic”  (we  will  explain  later  what  “first  order”  means). 

Our  task  will  be  to  make  sense  of  these  parts:  proper  names,  adjective  phrases,  and  the  “all”. 
We  can  begin  with  names. 

11.2  Symbolic  terms:  proper  names 

The  first  thing  we  want  to  add  to  our  expanded  language  are  names.  We  will  take  proper 
names  (such  as,  “Abraham  Lincoln”)  as  our  model.  General  names  (such  as  “Americans”) 
we  will  handle  in  a  different  way,  to  be  discussed  later.  We  will  call  these  proper  names  of 
our  language,  “names”. 

Recall  that  we  want  our  language  to  have  no  vagueness,  and  no  ambiguity.  A  name  would 
be  vague  if  it  might  or  might  not  pick  out  an  object.  So  we  will  require  that  each  name  pick 
out  an  object.  That  is,  a  name  may  not  be  added  to  our  language  if  it  refers  to  nothing,  or 
only  refers  to  something  under  some  conditions.  A  name  would  be  ambiguous  if  it  pointed 
at  more  than  one  thing.  “John  Smith”  is  a  name  that  points  at  thousands  of  people.  We 
will  not  allow  this  in  our  language.  Each  name  points  at  only  one  thing. 

We  might  decide  also  that  each  thing  that  our  language  talks  about  has  only  one  name. 
Some  philosophers  have  thought  that  such  a  rule  would  be  very  helpful.  However,  it  turns 
out  it  is  often  very  hard  to  know  if  two  apparent  things  are  the  same  thing,  and  so  in  a 
natural  language  we  often  have  several  names  for  the  same  thing.  A  favorite  example  of 
philosophers,  taken  from  the  philosopher  and  mathematician  Gottlob  Frege  (1848-1925),  is 
“Hesperus”  and  “Phosphorus.”  These  are  both  names  for  Venus,  although  some  who  used 
these  names  did  not  know  that.  Thus,  for  a  while,  some  people  did  not  know  that  Hesperus 
was  Phosphorus.  And,  of  course,  we  would  not  have  been  able  to  use  just  one  name  for 
both,  if  we  did  not  know  that  these  names  pointed  at  the  same  one  thing.  Thus,  if  we  want 
to  model  scientific  problems,  or  other  real  world  problems,  using  our  logic,  then  a  rule  that 
each  thing  have  one  and  only  one  name  would  demand  too  much:  it  would  require  us  to 
solve  all  our  mysteries  before  we  got  started.  In  any  case,  there  is  no  ambiguity  in  a  thing 
having  several  names. 

Names  refer  to  things.  But  when  we  say  a  refers  to  such  and  such  an  object,  then  if 
someone  asked,  “What  do  you  mean  by  ‘refer’?”,  we  would  be  hard  pressed  to  do  anything 
more  than  offer  a  list  of  synonyms:  a  points  at  the  object,  a  names  the  object,  a  indicates 
the  object,  a  picks  out  the  object.  “Refer”  is  another  primitive  that  we  are  adding  to 
our  language.  We  cannot  in  this  book  explain  what  reference  is;  in  fact,  philosophers 
vigorously  debate  this  today,  and  there  are  several  different  and  (seemingly)  incompatible 
theories  about  how  names  work.  However,  taking  “refer”  as  a  primitive  will  not  cause  us 
difficulties,  since  we  all  use  names  and  so  we  all  have  a  working  understanding  of  names 
and  how  they  refer. 


11.  Names  and  predicates  121 


In  our  language,  we  will  use  lower  case  letters,  from  the  beginning  of  the  alphabet,  for 
names.  Thus,  the  following  are  names: 

a 

b 

c 


In  a  natural  language,  there  is  more  meaning  to  a  name  than  what  it  points  at.  Gottlob 
Frege  was  intrigued  by  the  following  kinds  of  cases. 


a=a 


a=b 


Hesperus  is  Hesperus. 

Hesperus  is  Phosphorus. 

What  is  peculiar  in  these  four  sentences  is  that  the  first  and  third  are  trivial.  We  know  that 
they  must  be  true.  The  second  and  fourth  sentences,  however,  might  be  surprising,  even 
if  true.  Frege  observed  that  reference  cannot  constitute  all  the  meaning  of  a  name,  for  if  it 
did,  and  if  a  is  b ,  then  the  second  sentence  above  should  have  the  same  meaning  as  the  first 
sentence.  And,  if  Hesperus  is  Phosphorus,  the  third  and  fourth  sentences  should  have  the 
same  meaning.  But  obviously  they  don’t.  The  meaning  of  a  name,  he  concluded,  is  more 
than  just  what  it  refers  to.  He  called  this  extra  meaning  sense  (Sinn,  in  his  native  German). 

We  won’t  be  able  to  explore  these  subtleties.  We’re  going  to  reduce  the  meaning  of  our 
names  down  to  their  referent.  This  is  another  case  where  we  see  that  a  natural  language 
like  English  is  very  powerful,  and  contains  subtleties  that  we  avoid  and  simplify  away  in 
order  to  develop  our  precise  language. 

Finally,  let  us  repeat  that  we  are  using  the  word  “name”  in  a  very  specific  sense.  A  name 
picks  out  a  single  object.  For  this  reason,  although  it  may  be  true  that  “cat”  is  a  kind  of 
name  in  English,  it  cannot  be  properly  translated  to  a  name  in  our  logical  language.  Thus, 
when  considering  whether  some  element  of  a  natural  language  is  a  proper  name,  just  ask 
yourself:  is  there  a  single  thing  being  referred  to  by  this  element?  If  the  answer  is  no,  then 
that  part  of  the  natural  language  is  not  like  a  name  in  our  logical  language. 

11.3  Predicates 

Another  element  of  Aristotle’s  argument  above  that  we  want  to  capture  is  phrases  like  “is  a 
man”  and  “is  mortal”.  These  adjective  phrases  are  called  by  philosophers  “predicates”.  They 
tell  us  about  properties  or  relations  had  by  the  things  our  language  is  about.  In  our  sentence 
“Socrates  is  a  man”,  the  predicate  (“. . .  is  a  man”)  identifies  a  property  of  Socrates.  We  want 
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to  introduce  into  our  logical  language  a  way  to  express  these  predicates.  But  before  we  do 
this,  we  need  to  clarify  how  predicates  relate  to  the  objects  we  are  talking  about,  and  we 
want  to  be  sure  that  we  introduce  predicates  in  such  a  way  that  their  meaning  is  precise 
(they  are  not  vague  or  ambiguous). 

Our  example  of  “ . . .  is  a  man”  might  lead  us  to  think  that  predicates  identify  properties  of 
individual  objects.  But  consider  the  following  sentences. 

Tom  is  tall. 

Tom  is  taller  than  Jack. 

7  is  odd. 

7  is  greater  than  or  equal  to  5. 

The  first  and  third  sentence  are  quite  like  the  ones  we’ve  seen  before.  “Tom”  and  “7”  are 
names.  And  “. .  .is  tall”  and  “. .  .is  odd”  are  predicates.  These  are  similar  (at  least  in  terms  of 
their  apparent  syntax)  to  “Socrates”  and  “. . .  is  a  man”. 

But  what  about  those  other  two  sentences?  The  predicates  in  these  sentences  express 
relations  between  two  things.  And,  although  in  English  it  is  rare  that  a  predicate  expresses 
a  relation  of  more  than  two  things,  in  our  logical  language  a  predicate  could  identify  a 
relation  between  any  number  of  things.  We  need,  then,  to  be  aware  that  each  predicate 
identifies  a  relation  between  a  specific  number  of  things.  This  is  important,  because  the 
predicates  in  the  first  and  second  sentence  above  are  not  the  same.  That  is,  “...is  tall”  and 
“. . .  is  taller  than. . .”  are  not  the  same  predicate. 

Logicians  have  a  slang  for  this;  they  call  it  the  “arity”  of  the  predicate.  This  odd  word  comes 
from  taking  the  “ary”  on  words  like  “binary”  and  “trinary”,  and  making  it  into  a  noun. 
So,  we  can  say  the  following:  each  predicate  has  an  arity.  The  arity  of  a  predicate  is  the 
minimum  number  of  things  that  can  have  the  property  or  relation.  The  predicate  "...  is 
tall”  is  arity  one.  One  thing  alone  can  be  tall.  The  predicate  "...  is  taller  than...”  is  arity 
two.  You  need  at  least  two  things  for  one  to  be  taller  than  the  other. 

Thus,  consider  the  following  sentence. 

Stefano,  Margarita,  Aletheia,  and  Lorena  are  Italian. 

There  is  a  predicate  here,  "...  are  Italian.”  It  has  been  used  to  describe  four  things.  Is 
it  an  arity  four  predicate?  We  could  treat  it  as  one,  but  that  would  make  our  language 
deceptive.  Our  test  should  be  the  following  principle:  what  is  the  minimum  number  of 
things  that  can  have  that  property  or  relation?  In  that  case,  "...  are  Italian”  should  be  an 
arity  one  predicate  because  one  thing  alone  can  be  Italian.  Thus,  the  sentence  above  should 
be  understood  as  equivalent  to: 

Stefano  is  Italian  and  Margarita  is  Italian  and  Aletheia  is  Italian  and  Lorena  is 
Italian. 
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This  is  formed  using  conjunctions  of  atomic  sentences,  each  containing  the  same  arity  one 
predicate.  Consider  also  the  following  sentence. 

Stefano  is  older  than  Aletheia  and  Lorena. 

There  are  three  names  here.  Is  the  predicate  then  an  arity  three  predicate?  No.  The 
minimum  number  of  things  such  that  one  can  be  older  than  the  other  is  two.  From  this 
fact,  we  know  that  “...  is  older  than...”  is  an  arity  two  predicate.  This  sentence  is  thus 
equivalent  to: 

Stefano  is  older  than  Aletheia  and  Stefano  is  older  than  Lorena. 

This  is  formed  using  a  conjunction  of  atomic  sentences,  each  containing  the  same  arity  two 
predicate. 

Note  an  important  difference  we  need  to  make  between  our  logical  language  and  a  natural 
language  like  English.  In  a  natural  language  like  English,  we  have  a  vast  range  of  kinds  of 
names  and  kinds  of  predicates.  Some  of  these  could  be  combined  to  form  sentences  without 
any  recognizable  truth  value.  Consider: 

Jupiter  is  an  odd  number. 

America  is  taller  than  Smith. 

7  is  older  than  Jones. 

These  expressions  are  semantic  nonsense,  although  they  are  syntactically  well  formed.  The 
predicate  “. .  .is  an  odd  number”  cannot  be  true  or  false  of  a  planet.  America  does  not  have 
a  height  to  be  compared.  Numbers  do  not  have  an  age.  And  so  on. 

We  are  very  clever  speakers  in  our  native  languages.  We  naturally  avoid  these  kinds  of 
mistakes  (most  of  the  time).  But  our  logic  is  being  built  to  avoid  such  mistakes  always;  it 
aims  to  make  them  impossible.  Thus,  each  first  order  logical  language  must  have  what  we 
will  call  its  “domain  of  discourse”.  The  domain  of  discourse  is  the  set  of  things  that  our 
first  order  logic  is  talking  about.  If  we  want  to  talk  about  numbers,  people,  and  nations,  we 
will  want  to  make  three  different  languages  with  three  different  sets  of  predicates  and  three 
different  domains  of  discourse. 

We  can  now  state  our  rule  for  predicates  precisely.  A  predicate  of  arity  n  must  be  true  or 
false,  never  both,  never  neither,  of  each  n  objects  from  our  domain  of  discourse. 

This  will  allow  us  to  avoid  predicates  that  are  vague  or  ambiguous.  A  vague  predicate 
might  include,  “. .  .is  kind  of  tall.”  It  might  be  obviously  false  of  very  short  people,  but  it  is 
not  going  to  have  a  clear  truth  value  with  respect  to  people  who  are  of  height  slightly  above 
average.  If  a  predicate  were  ambiguous,  we  would  again  not  be  able  to  tell  in  some  cases 
whether  the  predicate  were  true  or  false  of  some  of  the  things  in  our  domain  of  discourse. 
An  example  might  include,  "...  is  by  the  pen.”  It  could  mean  is  by  the  writing  implement,  or 
it  could  mean  is  by  the  children’s  playpen.  Not  knowing  which,  we  would  not  be  able  to 
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tell  whether  a  sentence  like  “Fido  is  by  the  pen”  were  true  or  false.  Our  rule  for  predicates 
explicitly  rules  out  either  possibility. 

When  we  say,  “a  predicate  of  arity  n  is  true  or  false  of  each  n  objects  from  our  domain  of 
discourse”,  what  we  mean  is  that  an  arity  one  predicate  must  be  true  or  false  of  each  thing 
in  the  domain  of  discourse;  and  an  arity  two  predicate  must  be  true  or  false  of  every  possible 
ordered  pair  of  things  from  the  domain  of  discourse;  and  an  arity  three  predicate  must  be 
true  or  false  of  every  possible  ordered  triple  of  things  from  our  domain  of  discourse;  and  so 
on. 

We  will  use  upper  case  letters  from  F  on  to  represent  predicates  of  our  logical  language. 
Thus, 


F 

G 

H 

I 

J 

K 


are  predicates. 


11.4  First  order  logic  sentences 

We  can  now  explain  what  a  sentence  is  in  our  first  order  logic. 

We  need  to  decide  how  names  and  predicates  will  be  combined.  Different  methods  have 
been  used,  but  most  common  is  what  is  called  “prefix  notation”.  This  means  we  put  the 
predicate  before  names.  So,  if  we  had  the  sentences 

Tom  is  tall. 

Tom  is  taller  than  Steve. 

And  we  had  the  following  translation  key, 

Fx:  x  is  tall 

Gxy:  x  is  taller  than  y 
a:  Tom 


b:  Steve 


11.  Names  and  predicates  125 


Then  our  translations  would  be 
Fa 
Gab 

I  did  something  new  in  the  translation  key:  I  used  variables  to  identify  places  in  a  predicate. 
This  is  not  any  part  of  our  language,  but  just  a  handy  bit  of  bookkeeping  we  can  use  in 
explaining  our  predicates.  The  advantage  is  that  if  we  write  simply: 

G:  is  greater  than 

there  could  be  ambiguity  about  which  name  should  come  first  after  the  predicate  (the 
greater  than  name,  or  the  less  than  name).  We  avoid  this  ambiguity  by  putting  variables 
into  the  predicate  and  the  English  in  the  translation  key.  But  the  variables  are  doing  no 
other  work.  Don’t  think  of  a  predicate  as  containing  variables. 

The  sentence  above  that  we  had 

Stefano  is  Italian  and  Margarita  is  Italian  and  Aletheia  is  Italian  and  Lorena  is 
Italian. 

can  be  translated  with  the  following  key: 
lx:  x  is  Italian, 
c:  Stefano 
d:  Margarita 
e:  Aletheia 
f:  Lorena 

And  in  our  language  would  look  like  this: 

((lcAId)A(leAI/)) 

We  have  not  yet  given  a  formal  syntax  for  atomic  sentences  of  first  order  logic.  We  will 
need  a  new  concept  of  syntax — the  well  formed  formula  that  is  not  a  sentence — and  for  this 
reason  we  will  put  oft  the  specification  of  the  syntax  for  the  next  chapter. 

11.5  Problems 

1.  Translate  the  following  sentences  into  our  first  order  logic.  Provide  a  translation  key 
that  identifies  the  names  and  predicates. 

a.  Bob  is  a  poriferan. 

b.  Bob  is  neither  a  cnidarian  nor  female. 
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c.  Bob  is  a  male  poriferan. 

d.  Bob  is  not  a  male  poriferan. 

e.  Bob  is  a  poriferan  if  and  only  if  he  is  not  a  cnidarian. 

f.  Pat  is  not  both  a  poriferan  and  a  cnidarian. 

g.  Pat  is  not  a  poriferan,  though  he  is  male. 

h.  Pat  and  Bob  are  male. 

i.  Bob  is  older  than  Pat. 

j.  Pat  is  not  older  than  both  Sandi  and  Bob. 

2.  Identify  the  predicate  of  the  following  sentences,  and  identify  its  arity. 

a.  Aletheia  and  Lorena  are  tall. 

b.  Aletheia  and  Lorena  are  taller  than  Stefano  and  Margarita. 

c.  Margarita  is  younger  than  Aletheia,  Lorena,  and  Stefano. 

d.  Margarita  and  Stefano  live  in  Rome  and  Aletheia  and  Lorena  live  in  Milan. 

e.  Lorena  stands  between  Stefano  and  Aletheia. 

3.  Make  your  own  translation  key  for  the  following  sentences.  Use  your  key  to  write  the 
English  equivalents. 

a.  Fa. 

b.  Gab. 

c.  ( Gab  A  F b). 

d.  (Gab  A  Gac). 

e.  (FavFfi). 


12.  "All"  and  "some" 

1 2.1  The  challenge  of  translating  "all"  and  "some" 

We  are  still  not  able  to  translate  fully  Aristotle’s  argument.  It  began: 

All  men  are  mortal. 

What  does  this  “all”  mean? 

Let’s  start  with  a  simpler  example.  Suppose  for  a  moment  we  consider  the  sentence 
All  is  mortal. 

Or,  equivalently, 

Everything  is  mortal. 

How  should  we  understand  this  “all”  or  “everything”?  This  is  a  puzzle  that  stumped  many 
generations  of  logicians.  The  reason  is  that,  at  first,  it  seems  obvious  how  to  handle  this 
case.  “All”,  one  might  conclude,  is  a  special  name.  It  is  a  name  for  everything  in  my 
domain  of  discourse.  We  could  then  introduce  a  special  name  for  this,  with  the  following 
translation  key. 

e:  all  (or  everything) 

Mx:  x  is  mortal 

And,  so  we  translate  the  sentence 

Me 

So  far,  so  good.  But  now,  what  about  our  first  sentence?  Let’s  add  to  our  translation  key 
Hx:  x  is  human 

Now  how  shall  we  translate  “all  men  are  mortal”?  Most  philosophers  think  this  should  be 
captured  with  a  conditional  (we  will  see  why  below),  but  look  at  this  sentence: 

(He^Me) 

That  does  not  at  all  capture  what  we  meant  to  say.  That  sentence  says:  if  everything  is  a 
human,  then  everything  is  mortal.  We  want  to  say  just  that  all  the  humans  are  mortal. 

Using  a  dilferent  connective  will  not  help. 

(He  A  Me) 
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(He  v  Me) 

(He  ~  Me) 

All  of  these  fail  to  say  what  we  want  to  say.  The  first  says  everything  is  human  and 
everything  is  mortal.  The  second,  that  everything  is  human  or  everything  is  mortal.  The 
third  that  everything  is  human  if  and  only  if  everything  is  mortal. 

The  problem  is  even  worse  for  another  word  that  seems  quite  similar  in  its  use  to  “all”:  the 
word  “some”.  This  sentence  is  surely  true: 

Some  men  are  mortal. 

Suppose  we  treat  “some”  as  a  name,  since  it  also  appears  to  act  like  one.  We  might  have  a 
key  like  this: 

a  :  some 

And  suppose,  for  a  moment,  that  this  meant,  at  least  one  thing  in  our  domain  of  discourse.  And 
then  translate  our  example  sentence,  at  least  as  a  first  attempt,  as 

(Ha  A  Ma) 

This  says  that  some  things  are  human,  and  some  things  are  mortal.  It  might  seem  at  first  to 
work.  But  now  consider  a  different  sentence. 

Some  things  are  human  and  some  things  are  crabs. 

That’s  true.  Let  us  introduce  the  predicate  Kx  for  x  is  a  crab.  Then,  it  would  seem  we 
should  translate  this 

(Ha  A  Ka) 

But  that  does  not  work.  For  a,  if  it  is  a  name,  must  refer  to  the  same  thing.  But,  then 
something  is  both  a  human  and  a  crab,  which  is  false. 

“All”  and  “some”  are  actually  subtle.  They  look  and  (in  some  ways)  act  like  names,  but  they 
are  different  than  names.  So,  we  should  not  treat  them  as  names. 

e:  all  (or  everything) 

a  :  some 

This  perplexed  many  philosophers  and  mathematicians,  but  finally  a  very  deep  thinker 
whom  we  have  already  mentioned — Gottlob  Frege — got  clear  about  what  is  happening 
here,  and  developed  what  we  today  call  the  “quantifier”. 

The  insight  needed  for  the  quantifier  is  that  we  need  to  treat  “all”  and  “some”  as  special 
operators  that  can  “bind”  or  “reach  into”  potentially  several  of  the  arity  places  in  one  or 
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more  predicates.  To  see  the  idea,  consider  first  the  simplest  case.  We  introduce  the  symbol 
V  for  all.  However,  we  also  introduce  a  variable — in  this  case  we  will  use  x — to  be  a 
special  kind  of  place  holder.  (Or:  you  could  think  of  V  as  meaning  every  and  x  as  meaning 
thing,  and  then  Vx  means  everything .)  Now,  to  say  “everything  is  human”,  we  would  write 

VxHx 

Think  of  this  sentence  as  saying,  you  can  take  any  object  from  our  domain  of  discourse,  and  that 
thing  has  property  H.  In  other  words,  if  VxHx  is  true,  then  Ha  is  true,  and  H b  is  true,  and 
He  is  true,  and  so  on,  for  all  the  objects  of  our  domain  of  discourse. 

So  far,  this  is  not  much  different  than  using  a  single  name  to  mean  “everything”.  But  there 
is  a  very  significant  difference  when  we  consider  a  more  complex  formula.  Consider  “All 
men  are  mortal”.  Most  logicians  believe  this  means  that  “Everything  is  such  that,  if  it  is 
human,  then  it  is  mortal”.  We  can  write 

Vx(Hx^Mx) 

So,  if  Vx(Hx^Mx)  is  true,  then  (Ha^Ma)  and  (Hb~>Mb)  and  (Hc^Mc)  and  so  on  are  true. 

This  captures  exactly  what  we  want.  We  did  not  want  to  say  if  everything  is  human,  then 
everything  is  mortal.  We  wanted  to  say,  for  each  thing,  if  it  is  human,  then  it  is  mortal. 

A  similar  approach  will  work  for  “some”.  Let  “3”  be  our  symbol  for  “some”.  Then  we  can 
translate 


Some  men  are  mortal 


With 


3x(HxAMx) 

(We  will  discuss  in  section  13.3  below  why  we  do  not  use  a  conditional  here;  at  this  point, 
we  just  want  to  focus  on  the  meaning  of  the  “3”.)  Read  this  as  saying,  for  this  example, 
there  is  at  least  one  thing  from  our  domain  of  discourse  that  has  properties  H  and  M.  In  other 
words,  either  (Ha  A  Ma)  is  true  or  (H  b  A  Mb)  is  true  or  (He  A  Me)  is  true  or  etc. 

These  new  elements  to  our  language  are  called  “quantifiers”.  The  symbol  “V”  is  called  the 
“universal  quantifier”.  The  symbol  “3”  is  called  the  “existential  quantifier”  (to  remember 
this,  think  of  it  as  saying,  “there  exists  at  least  one  thing  such  that...”).  We  say  that  they 
“quantify  over”  the  things  that  our  language  is  about  (that  is,  the  things  in  our  domain  of 
discourse). 

We  are  now  ready  to  provide  the  syntax  for  terms,  predicates,  and  quantifiers. 

12.2  A  new  syntax 

For  the  propositional  logic,  our  syntax  was  always  trivial.  For  the  first  order  logic  our 
syntax  will  be  more  complex.  We  will  need  a  new  concept,  the  concept  of  a  “well-formed 
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formula”.  And  we  will  need  to  make  more  explicit  use  of  the  fact  that  our  syntax  is  a 
recursive  syntax,  which  means  that  our  rules  must  be  stated  with  a  first  case,  and  then  a 
way  to  repeatedly  apply  our  syntactic  rules.  We  are,  also,  going  to  change  one  feature  of 
our  metalanguage.  The  symbol  $  will  no  longer  mean  a  sentence.  Instead,  it  is  any  well- 
formed  expression  of  our  language.  We  can  write  <&(a)  to  mean  that  the  name  a  appears 
in  $>;  this  does  not  mean  that  $  is  an  arity-one  predicate  with  the  single  name  a.  $  can  be 
very  complex.  For  example,  $  could  be  the  expression  ((Fa^Gbc)*Hd). 

A  symbolic  term  is  either  a  name,  an  indefinite  name,  an  arbitrary  term,  or  a  variable 
(we  will  explain  what  indefinite  terms  and  arbitrary  terms  are  later).  Names  are  a,  b, 
c,  d....  Indefinite  names  are  p,  q,  r. ...  Variables  are  u ,  v,  w,  x,  y,  z.  Arbitrary  terms  are 
u',  v\  u/,  x\  y’,  z'. 

A  predicate  of  arity  n  followed  by  n  symbolic  terms  is  a  well-formed  formula. 

If  $  and  W  are  well-formed  formulas,  and  a  is  a  variable,  then  the  following  are  well- 
formed  formulas: 

($-*vp) 

($  AVp) 

($  v'F) 

VaS> 

3a$ 

If  the  expression  $>(a)  contains  no  quantifiers,  and  a  is  a  variable,  then  we  say  that  a  is  a 
“free  variable”  in  5>(a).  If  the  expression  $(a)  contains  no  quantifiers,  and  a  is  a  variable, 
then  we  say  that  a  is  “bound”  in  Va$(a)  and  a  is  “bound”  in  3a$(a).  A  variable  that  is 
bound  is  not  free. 

If$  is  a  well-formed  formula  with  no  free  variables,  then  it  is  a  sentence. 

If  3>  and  VF  are  sentences,  then  the  following  are  sentences: 


($  Aq/) 

(d>  vW) 
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This  way  of  expressing  ourselves  is  precise;  but,  for  some  of  us,  when  seeing  it  for  the  first 
time,  it  is  hard  to  follow.  Let’s  take  it  a  step  at  a  time.  Let’s  suppose  that  F  is  a  predicate  of 
arity  one,  that  G  is  a  predicate  of  arity  two,  and  that  H  is  a  predicate  of  arity  three.  Then 
the  following  are  all  well-formed  formulas. 

Fx 

Fy 

Fa 

Gxy 

Gyx 

Gab 

Gax 

H  xyz 

H  axe 

H  czy 

And,  if  we  combine  these  with  connectives,  they  form  well-formed  formulas.  All  of  these 
are  well-formed  formulas: 

-■Fx 

(Fx-^Fy) 

(Fa*  Gxy) 

(Gyx  v  Gab ) 

(Gax^Hxyz) 

VxH  axe 
3zHczy 

For  these  formulas,  we  say  that  x  is  a  free  variable  in  each  of  the  first  five  well-formed 
formulas.  The  variable  x  is  bound  in  the  sixth  well-formed  formula.  The  variable  z  is 
bound  in  the  last  well-formed  formula,  but  y  is  free  in  that  formula. 
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For  the  following  formulae,  there  are  no  free  variables. 

VxFx 

3zGza 
Fa 
G  be 

Each  of  these  four  well-formed  formulas  is,  therefore,  a  sentence.  If  combined  using  our 
connectives,  these  would  make  additional  sentences.  For  example,  these  are  all  sentences: 

-A/xFx 

(VxFx^SzGza) 

{Fa  A  G be) 

(G be  3zGza) 

(G be  v  3zGza) 

The  basic  idea  is  that  in  addition  to  sentences,  we  recognize  formulae  that  have  the  right 
shape  to  be  a  sentence,  if  only  they  had  names  instead  of  variables  in  certain  places  in 
the  formula.  These  then  become  sentences  when  combined  with  a  quantifier  binding  that 
variable,  because  now  the  variable  is  no  longer  a  meaningless  placeholder,  and  instead  stands 
for  any  or  some  object  in  our  language. 

What  about  the  semantics  for  the  quantifiers?  This  will,  unfortunately,  have  to  remain 
intuitive  during  our  development  of  first  order  logic.  We  need  set  theory  to  develop  a 
semantics  for  the  quantifiers;  truth  tables  will  not  work.  In  chapter  17,  you  can  read  a 
little  about  how  to  construct  a  proper  semantics  for  the  quantifiers.  Here,  let  us  simply 
understand  the  universal  quantifier,  “V”,  as  meaning  every  object  in  our  domain  of  discourse; 
and  understand  the  existential  quantifier,  “3”,  as  meaning  at  least  one  object  in  our  domain  of 
discourse. 

A  note  about  the  existential  quantifier.  “Some”  in  English  does  not  often  mean  at  least  one. 
If  you  ask  your  friend  for  some  of  her  french  fries,  and  she  gives  you  exactly  one,  you  will 
feel  cheated.  However,  we  will  likely  agree  that  there  is  no  clear  norm  for  the  number 
of  french  fries  that  she  must  give  you,  in  order  to  satisfy  your  request.  In  short,  the  word 
“some”  is  vague  in  English.  This  is  a  useful  vagueness — we  don’t  want  to  have  to  say  things 
like,  “Give  me  11  french  fries,  please”.  But,  our  logical  language  must  be  precise,  and  so,  it 
must  have  no  vagueness.  For  this  reason,  we  interpret  the  existential  quantifier  to  mean  at 
least  one. 


1 2.3  Common  sentence  forms  for  quantifiers 

Formulas  using  quantifiers  can  have  very  complex  meanings.  However,  translating  from 
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English  to  first  order  logic  expressions  is  usually  surprisingly  easy,  because  in  English  many 
of  our  phrases  using  “all”  or  “some”  or  similar  phrases  are  of  eight  basic  forms.  Once  we 
memorize  those  forms,  we  can  translate  these  kinds  of  phrases  from  English  into  logic. 

Here  are  examples  of  the  eight  forms,  using  some  hypothetical  sentences. 

Everything  is  human. 

Something  is  human. 

Something  is  not  human. 

Nothing  is  human. 

All  humans  are  mortal. 

Some  humans  are  mortal. 

Some  humans  are  not  mortal. 

No  humans  are  mortal. 

Our  goal  is  to  decide  how  best  to  translate  each  of  these.  Then,  we  will  generalize.  Let  us 
use  our  key  above,  in  which  “Hx”  means  x  is  human,  and  “Mx”  means  x  is  mortal. 

The  first  two  sentences  are  straightforward.  The  following  are  translations. 

VxHx 

3xHx 

What  about  the  third  sentence?  It  is  saying  there  is  something,  and  that  thing  is  not  human. 
A  best  translation  of  that  would  be  to  start  with  the  “something”. 

3xnHx 

That  captures  what  we  want.  At  least  one  thing  is  not  human.  Contrast  this  with  the  next 
sentence.  We  can  understand  it  as  saying,  It  is  not  the  case  that  something  is  human.  That  is 
translated: 


i3xHx 

(It  turns  out  that  “3x-iHx”  and  “-iVxHx”  are  equivalent  and  “-i3xHx”  and  “Vx-iHx”  are 
equivalent:  so  we  could  also  translate  “Something  is  not  human”  with  “-A/xHx”,  and 
“Nothing  is  human”  with  “Vx-iHx”.  However,  this  author  finds  these  less  close  to  the 
English  in  syntactic  form.) 

The  next  four  are  more  subtle.  “All  humans  are  mortal”  seems  to  be  saying,  if  anything  is 
human,  then  that  thing  is  mortal.  That  tells  us  directly  how  to  translate  the  expression: 
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Vx(Hx^Mx) 

What  about  “some  humans  are  mortal”?  This  is  properly  translated  with: 

3x(HxAMx) 

Many  students  suspect  there  is  some  deep  similarity  between  “all  humans  are  mortal” 
and  “some  humans  are  mortal”,  and  so  want  to  translate  “some  humans  are  mortal”  as 
3x(Hx-^Mx).  This  would  be  a  mistake.  Remember  the  truth  table  for  the  conditional;  if 
the  antecedent  is  false,  then  the  conditional  is  true.  Thus,  the  formula  Bx(Hx^Mx)  would 
be  true  if  there  were  no  humans,  and  it  would  be  true  if  there  were  no  humans  and  no 
mortals. 

That  might  seem  a  bit  abstract,  so  let’s  leave  oft  our  language  about  humans  and  mortality, 
and  consider  a  different  first  order  logic  language,  this  one  about  numbers.  Our  domain  of 
discourse,  let  us  suppose,  is  the  natural  numbers  (l,  2,  3,  . . .).  Let  “Fx”  mean  “x  is  even”  and 
“Gx”  mean  “x  is  odd”.  Now  consider  the  following  formula: 

Some  even  number  is  odd. 

We  can  agree  that,  for  the  usual  interpretation  of  “odd”  and  “even”,  this  sentence  is  false. 
But  now  suppose  we  translated  it  as 

3x(Fx^Gx) 

This  sentence  is  true.  That’s  because  there  is  at  least  one  object  in  our  domain  of  discourse 
for  which  it  is  true.  For  example,  consider  the  number  3  (or  any  odd  number).  Suppose 
that  in  our  logical  language,  a  means  3.  Then,  the  following  sentence  is  true: 

(Fa-^Ga) 

This  sentence  is  true  because  the  antecedent  is  false,  and  the  consequent  is  true.  That  makes 
the  whole  conditional  true. 

Clearly,  “3x(Fx^Gx)”  cannot  be  a  good  translation  of  “Some  even  number  is  odd”,  because 
whereas  “Some  even  number  is  odd”  is  false,  “3x(Fx-^Gx)”  is  true.  The  better  translation  is 

3x(FxAGx) 

This  says,  some  number  is  both  even  and  odd.  That’s  clearly  false,  matching  the  truth  value 
of  the  English  expression. 

To  return  to  our  language  about  humans  and  mortality.  The  sentence  “some  human  is 
mortal”  should  be  translated 

3x(HxAMx) 

And  this  makes  clear  how  we  can  translate,  “some  human  is  not  mortal”: 
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3x(Hx  a  -iMx) 

The  last  sentence,  “No  humans  are  mortal”  is  similar  to  “Nothing  is  human”.  We  can  read 
it  as  meaning  It  is  not  the  case  that  some  humans  are  mortal,  which  we  can  translate: 

i3x(HxAMx) 

(It  turns  out  that  this  sentence  is  equivalent  to,  “all  humans  are  not  mortal”.  Thus,  we  could 
also  translate  the  sentence  with: 

Vx(Hx^nMx).) 

We  need  to  generalize  these  eight  forms.  Let  $>  and  VF  be  expressions  (these  can  be 
complex).  Let  a  be  any  variable.  Then,  we  can  give  the  eight  forms  schematically  in  the 
following  way. 

Everything  is  $ 

Va5>(a) 

Something  is  $ 

3a$(a) 

Something  is  not  $ 

3a-i$(a) 

Nothing  is  $ 

-i3a$(a) 

All  $  are  W 
Va($(a)->W(a)) 

Some  3>  are  W 

3a  ($(a)  A  'T(a)) 

Some  $  are  not  W 
3a  ($(a)  A  -ivE(a)) 

No  are  W 
i3a  ($(a)A  'T(a)) 


These  eight  forms  include  the  most  common  forms  of  sentences  that  we  encounter  in 
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English  that  use  quantifiers.  This  may  not,  at  first,  seem  plausible,  but,  when  we  recognize 
that  these  generalized  forms  allow  that  the  expression  <I>  or  h7  can  be  complex,  then,  we  see 
that  the  following  are  examples  of  the  eight  forms,  given  in  the  same  order: 

Everything  is  a  female  human  from  Texas. 

Something  is  a  male  human  from  Texas. 

Something  is  not  a  female  human  computer  scientist  from  Texas. 

Nothing  is  a  male  computer  scientist  from  Texas. 

All  male  humans  are  mortal  mammals. 

Some  female  humans  are  computer  scientists  who  live  in  Texas. 

Some  female  humans  are  not  computer  scientists  who  live  in  Texas. 

No  male  human  is  a  computer  scientist  who  lives  in  Texas. 

The  task  in  translating  such  sentences  is  to  see,  when  we  refer  back  to  our  schemas,  that 
$  and  h7  can  be  complex.  Thus,  if  we  add  to  our  key  the  following  predicates: 

Fx:  x  is  female 

Gx:  x  is  male 

Tx:  x  is  from  Texas 

Sx:  x  is  a  computer  scientist 

Lx:  x  is  a  mammal 

Then,  we  can  see  that  the  following  are  translations  of  the  eight  English  sentences,  and  they 
utilize  the  eight  forms. 

Vx((FxAHx)  A  Tx) 

3x((GxAHx)ATx) 

3x  i((FxAHx)A(SxATx)) 

i3x((GxASx)ATx) 

Vx((GxAHx)  -»  (MxALx)) 

3x((FxaHx)  a  (SxaTx)) 

3x((FxaHx)  a  i(SxaTx)) 


12.  “All”  and  “some”  137 


-i3x((GxAHx)A(SxATx)) 

Another  important  issue  to  be  aware  of  when  translating  expressions  with  quantifiers  is  that 
“only”  plays  a  special  role  in  some  English  expressions.  Consider  the  following  sentences. 

All  sharks  are  fish. 

Only  sharks  are  fish. 

The  first  of  these  is  true;  the  second  is  false.  We  will  start  a  new  logical  language  and  key. 
Let  Fx  mean  that  x  is  a  fish,  and  Sx  mean  that  x  is  a  shark.  We  know  how  to  translate  the 
first  sentence. 

Vx(Sx  -»  Fx) 

However,  how  shall  we  translate  “Only  sharks  are  fish”?  This  sentence  tells  us  that  the  only 
things  that  are  fish  are  the  sharks.  But  then,  all  fish  are  sharks.  That  is,  the  translation  is: 

Vx(Fx  -»  Sx) 

It  would  also  be  possible  to  combine  these  claims: 

All  and  only  sharks  are  fish. 

Which  should  be  translated: 

Vx(Sx  Fx) 

This  indicates  two  additional  schemas  for  translation  that  may  be  useful.  First,  sentences  of 
the  form  “Only  5>  are  VF”  should  be  translated: 

Va(W(a)  -  $(a)) 

Second,  sentences  of  the  form  “all  and  only  are  W”  should  be  translated  in  the  following 
way: 


Va($(a)  lF(a)) 


12.4  Problems 

1.  Which  of  the  following  expressions  has  a  free  variable?  Identify  the  free  variable  if 
there  is  one.  Assume  F  is  an  arity  one  predicate,  and  G  is  an  arity  two  predicate. 

a.  Fa 

b.  Fx 
C.  Gx<3 

d.  3xFx 

e.  VxG  xa 
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f.  VxG  xy 

g.  Vx(F  x  -»  G  xa) 

h.  (VxFx  -»  G xa) 

i.  (VxFx  -»  VxG  xa) 

j.  Vx(F  x  -»  Gxy) 

2.  Provide  a  key  and  translate  the  following  expressions  into  first  order  logic.  Assume  the 
domain  of  discourse  is  Terrestrial  organisms.  Thus,  VxFx  would  mean  all  Terrestrial 
organisms  are  F,  and  3xFx  would  mean  at  least  one  Terrestrial  organisms  is  F.  Don’t  be 
concerned  that  some  of  these  sentences  are  obviously  false. 

a.  All  horses  are  mammals. 

b.  Some  horses  are  mammals. 

c.  No  horses  are  mammals. 

d.  Some  horses  are  not  mammals. 

e.  Some  mammals  lay  eggs,  and  some  mammals  do  not. 

f.  Some  chestnut  horses  are  mammals  that  don’t  lay  eggs. 

g.  No  chestnut  horses  are  mammals  that  lay  eggs. 

h.  Some  egg-laying  mammals  are  not  horses. 

i.  There  are  no  horses. 

j.  There  are  some  mammals. 

k.  Only  horses  are  mammals. 

l.  All  and  only  horses  are  mammals. 

3.  Provide  your  own  key  and  translate  the  following  expressions  of  first  order  logic  into 
natural  sounding  English  sentences.  All  the  predicates  here  are  meant  to  be  arity  one. 
Do  not  worry  if  some  of  your  sentences  are  obviously  false;  you  rather  want  to  show 
you  can  translate  from  logic  to  normal  sounding  English. 

a.  Vx((FxAGx)  ->Hx) 

b.  Vx(Fx^iHx) 

c.  3x((FxA(GxAHx)) 

d.  3x((FxAi(GxAHx)) 

e.  ->3x(FxAGx) 


13.  Reasoning  with  quantifiers 

13.1  Using  the  universal  quantifier 

How  shall  we  construct  valid  arguments  using  the  existential  and  the  universal  quantifier? 
The  semantics  for  the  quantifiers  must  remain  intuitive.  However,  they  are  sufficiently 
clear  for  us  to  introduce  some  rules  that  will  obviously  preserve  validity.  In  this  chapter,  we 
will  review  three  inference  rules,  ordering  them  from  the  easiest  to  understand  to  the  more 
complex. 

The  easiest  case  to  begin  with  is  the  universal  quantifier.  Recall  Aristotle’s  argument: 

All  men  are  mortal. 

Socrates  is  a  man. 


Socrates  is  mortal. 

We  now  have  the  tools  to  represent  this  argument. 
Vx(Hx->Mx) 

Ha 


Ma 

But,  how  can  we  show  that  this  argument  is  valid? 

The  important  insight  here  concerns  the  universal  quantifier.  We  understand  the  first 
sentence  as  meaning,  for  any  object  in  my  domain  of  discourse,  if  that  object  is  human,  then 
that  object  is  mortal.  That  means  we  could  remove  the  quantifier,  put  any  name  in  our 
language  into  the  free  x  slots  in  the  resulting  formula,  and  we  would  have  a  true  sentence: 
(Ha^Ma)  and  (Hb^Mb)  and  (Hc^Mc)  and  (Hffi^Md)  and  so  on  would  all  be  true. 

We  need  only  make  this  semantic  concept  into  a  rule.  We  will  call  this,  “universal 
instantiation”.  To  remember  this  rule,  just  remember  that  it  is  taking  us  from  a  general  and 
universal  claim,  to  a  specific  instance.  That’s  what  we  mean  by  “instantiation”.  We  write 
the  rule,  using  our  metalanguage,  in  the  following  way.  Let  a  be  any  variable,  and  let  be 
any  symbolic  term. 

Va$>(a) 
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*(P) 

This  is  a  very  easy  rule  to  understand.  One  removes  the  quantifier,  and  replaces  every 
free  instance  of  the  formerly  bound  variable  with  a  single  symbolic  term  (this  is  important: 
the  instance  that  replaces  your  variable  must  be  the  same  symbolic  term  throughout — you 
cannot  instantiate  Vx(Hx^Mx)  to  for  example). 

With  this  rule,  we  can  finally  prove  Aristotle’s  argument  is  valid. 


1.  Vx(Hx  — »  Mx) 

2.  Ha 

3.  (Ha  — >  Ma) 

4.  Ma 


premise 

premise 

universal  instantiation,  1 
modus  ponens,  3,  2 


13.2  Showing  the  existential  quantifier 

Consider  the  following  argument. 

All  men  are  mortal. 


Socrates  is  a  man. 


Something  is  mortal. 

This  looks  to  be  an  obviously  valid  argument,  a  slight  variation  on  Aristotle’s  original 
syllogism.  Consider:  if  the  original  argument,  with  the  same  two  premises,  was  valid,  then 
the  conclusion  that  Socrates  is  mortal  must  be  true  if  the  premises  are  true.  But,  if  it  must  be 
true  that  Socrates  is  mortal,  then  it  must  be  true  that  something  is  mortal.  Namely,  at  least 
Socrates  is  mortal  (recall  that  we  interpret  the  existential  quantifier  to  mean  at  least  one). 

We  can  capture  this  reasoning  with  a  rule.  If  a  particular  object  has  a  property,  then, 
something  has  that  property.  Written  in  our  meta-language,  where  |3  is  some  symbolic 
term  and  a  is  a  variable: 

*(P) 


3a<£(a) 


This  rule  is  called  “existential  generalization”.  It  takes  an  instance  and  then  generalizes  to  a 
general  claim. 

We  can  now  show  that  the  variation  on  Aristotle’s  argument  is  valid. 
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1.  Vx(Hx  — >•  Mx) 

2.  Ha 

3.  (Ha  — >  Ma) 

4.  Ma 

5.  3xMx 


premise 

premise 

universal  instantiation,  1 
modus  ponens,  3,  2 
existential  generalization,  4 


13.3  Using  the  existential  quantifier 

Consider  one  more  variation  of  Aristotle’s  argument. 

All  men  are  mortal. 


Something  is  a  man. 


Something  is  mortal. 

This,  too,  looks  like  it  must  be  a  valid  argument.  If  the  first  premise  is  true,  then  any  human 
being  you  could  find  would  be  mortal.  And,  the  second  premise  tells  us  that  something  is 
a  human  being.  So,  this  something  must  be  mortal. 

But,  this  argument  confronts  us  with  a  special  problem.  The  argument  does  not  tell  us 
which  thing  is  a  human  being.  This  might  seem  trivial,  but  it  really  is  only  trivial  in 
our  example  (because  you  know  that  there  are  many  human  beings).  In  mathematics,  for 
example,  there  are  many  very  surprising  and  important  proofs  that  some  number  with  some 
strange  property  exists,  but  no  one  has  been  able  to  show  specifically  which  number.  So, 
it  can  happen  that  we  know  that  there  is  something  with  a  property,  but,  not  know  what 
thing. 

Logicians  have  a  solution  to  this  problem.  We  will  introduce  a  special  kind  of  name,  which 
refers  to  something,  but  we  know  not  what.  Call  this  an  “indefinite  name”.  We  will  use 
p,  q,  r...  as  these  special  names  (we  know  these  are  not  atomic  sentences  because  they  are 
lowercase).  Then,  where  y  is  some  indefinite  name  and  a  is  a  variable,  our  rule  is: 

3a4>(a) 


O(x) 

where  y  is  an  indefinite  name  that  does  not  appear  above  in  an  open  proof 

This  rule  is  called  “existential  instantiation”.  By  “open  proof”  we  mean  a  subproof  that  is 
not  yet  complete. 


The  last  clause  is  important.  It  requires  us  to  introduce  indefinite  names  that  are  new.  If 
an  indefinite  name  is  already  being  used  in  your  proof,  then  you  must  use  a  new  indefinite 
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name  if  you  do  existential  instantiation.  This  rule  is  a  little  bit  stronger  than  is  required 
in  all  cases,  but  it  is  by  far  the  easiest  way  to  avoid  a  kind  of  mistake  that  would  produce 
invalid  arguments.  To  see  why  this  is  so,  let  us  drop  the  clause  for  the  sake  of  an  example. 
In  this  example,  we  will  prove  that  the  Pope  is  the  President  of  the  United  States.  We  need 
only  the  following  key. 

Hx:  x  is  the  President  of  the  United  States. 


Jx:  x  is  the  Pope. 

Here  are  two  very  plausible  premises,  which  I  believe  that  you  will  grant:  there  is  a 
President  of  the  United  States,  and  there  is  a  Pope.  So,  here  is  our  proof: 


1.  3xHx 

2.  3xJx 


premise 

premise 


3.  H p 

4.  J p 

5.  (H p  A  J p) 

6.  3x{Y\x  A  Jx) 


existential  instantiation,  1 
existential  instantiation,  2 
adjunction,  3,  4 
existential  generalization,  5 


Thus,  we  have  just  proved  that  there  is  a  President  of  the  United  States  who  is  Pope. 

But  that’s  false.  We  got  a  false  conclusion  from  true  premises — that  is,  we  constructed 
an  invalid  argument.  What  went  wrong?  We  ignored  the  clause  on  our  existential 
instantiation  rule  that  requires  that  the  indefinite  name  used  when  we  apply  the  existential 
instantiation  rule  cannot  already  be  in  use  in  the  proof.  In  line  4,  we  used  the  indefinite 
name  “p”  when  it  was  already  in  use  in  line  3. 

Instead,  if  we  had  followed  the  rule,  we  would  have  a  very  different  proof: 


1.  3xHx 

2.  3x]x 


premise 

premise 


3.  H p 

4.  J  q 

5.  (H p  A  Jg) 


existential  instantiation,  1 
existential  instantiation,  2 
adjunction,  3,  4 


Because  we  cannot  assume  that  the  two  unknowns  are  the  same  thing,  we  give  them 
each  a  temporary  name  that  is  different.  Since  existential  generalization  replaces  only 
one  symbolic  term,  from  line  five  you  can  only  generalize  to  3x(Hx  A  j«)  or  to  3x(Hp  A 
Jx) — or,  if  we  performed  existential  generalization  twice,  to  something  like  3x3y(Hx  A  Jr)- 
Each  of  these  three  sentences  would  be  true  if  the  Pope  and  the  President  were  different 
things,  which  in  fact  they  are. 

We  can  now  prove  that  the  variation  on  Aristotle’s  argument,  given  above,  is  valid. 
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1.  Vx(Hx  — >•  Mx) 

2.  3xHx 

3.  Hp 

4.  (Hp  — >  Mp) 

5.  Mp 

6.  3xMx 


premise 

premise 

existential  instantiation,  2 
universal  instantiation,  1 
modus  ponens,  4,  3 
existential  generalization,  5 


A  few  features  of  this  proof  are  noteworthy.  We  did  existential  instantiation  first,  in  order 
to  obey  the  rule  that  our  temporary  name  is  new:  “p”  does  not  appear  in  any  line  in  the 
proof  before  line  3.  But,  then,  we  are  permitted  to  do  universal  instantiation  to  “p”,  as  we 
did  on  line  4.  A  universal  claim  is  true  of  every  object  in  our  domain  of  discourse,  including 
the  I-know-not-what. 


We  can  consider  an  example  that  uses  all  three  of  these  rules  for  quantifiers.  Consider  the 
following  argument. 

All  whales  are  mammals.  Some  whales  are  carnivorous.  All  carnivorous 
organisms  eat  other  animals.  Therefore,  some  mammals  eat  other  animals. 

We  could  use  the  following  key. 

Fx:  x  is  a  whale. 


Gx:  x  is  a  mammal. 


Hx:  x  is  carnivorous. 

Ix:  x  eats  other  animals. 
Which  would  give  us: 

Vx(Fx^>Gx) 

3x(FxAHx) 

Vx(Hx^Ix) 


3x(GxAIx) 


Here  is  one  proof  that  the  argument  is  valid. 
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1.  Vx(Fx  — *  Gx) 

premise 

2.  3x(Fx  A  Hx) 

premise 

3.  Vx(Hx  — »  lx) 

premise 

4.  (F p  A  H p) 

existential  instantiation,  2 

5.  F p 

simplification,  4 

6.  (F p  — >  G p) 

universal  instantiation,  1 

7.  Gp 

modus  ponens,  6,  5 

8.  H p 

simplification,  4 

9.  (H p  — >  1  p) 

universal  instantiation,  3 

10.  1  p 

modus  ponens,  9,  8 

11.  (H p  A  1  p) 

adjunction,  8,  10 

12.  3x(Gx  A  lx) 

existential  generalization,  11 

13.4  Problems 

1.  Prove  the  following  arguments  are  valid.  Note  that,  in  addition  to  the  new  rules  for 
reasoning  with  quantifiers,  you  will  still  have  to  use  techniques  like  conditional 
derivation  (when  proving  a  conditional)  and  indirect  derivation  (when  proving 
something  that  is  not  a  conditional,  and  for  which  you  cannot  find  a  direct  derivation). 
These  wul  require  universal  instantiation. 

a.  Premises:  Vx(Fx  -»  Gx),  Fa,  F b.  Conclusion:  (G a  A  G b). 

b.  Premises:  Vx(Hx  Fx),  iFc.  Conclusion:  -iHc. 

c.  Premises:  Vx(Gx  v  Hx),  -HL  Conclusion:  G b. 

d.  Premises:  Vx(Fx  -»  Gx),  Vx(Gx  -»  Hx).  Conclusion:  (Fa  -»  Ha). 

e.  Premises:  Vx(Gx  v  lx),  Vx(Gx  -» Jx),  Vx(Ix  -»Jx).  Conclusion:  ]b. 

2.  Prove  the  following  arguments  are  valid.  These  will  require  existential  generalization. 

a.  Premises:  Vx(F  x  Gx),  G d.  Conclusion:  3x(Gx  A  Fx). 

b.  Premises:  (Ga  A  Fa),  Vx(Fx  <-»  Hx),  Vx(iGx  v  Jx).  Conclusion:  3x(Hx  A  Jx). 

c.  Premises:  i(Fa  A  Ga).  Conclusion:  3x(-iFx  v  -iGx). 

d.  Conclusion:  3x(Fx  v  -iFx) 

3.  Prove  the  following  arguments  are  valid.  These  will  require  existential  instantiation. 

a.  Premises:  3x-i(Fx  A  Gx).  Conclusion:  3x(-iFx  v  iGx). 

b.  Premises:  3x(Fx  A  Gx),  Vx(iGx  v  Kx),  Vx(Fx  -»  Hx).  Conclusion:  3x(Hx  A  Kx). 

c.  Conclusion:  (Vx(Fx  -»  Gx)  -»  (3xFx  -»  3xGx)) 

d.  Conclusion:  (Vx(Fx  -»  Gx)  -»  (3x-iGx  -»  3x-iFx)) 

4.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 
premises,  and  where  at  least  one  premise  is  an  existential  claim.  Your  argument  should 
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just  be  a  paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  like 
formal  logic).  Translate  it  into  first  order  logic  and  prove  it  is  valid. 

5.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 
premises  and  with  a  conclusion  that  is  an  existential  claim.  Your  argument  should  just 
he  a  paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  like  formal 
logic).  Translate  it  into  first  order  logic  and  prove  it  is  valid. 

6.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 
premises,  and  where  at  least  one  premise  is  a  universal  claim.  Your  argument  should 
just  be  a  paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  like 
formal  logic).  Translate  it  into  first  order  logic  and  prove  it  is  valid. 

7.  Some  philosophers  have  developed  arguments  attempting  to  prove  that  there  is  a  god. 

One  such  argument,  which  was  very  influential  until  Darwin,  is  the  Design 
Argument.  The  Design  Argument  has  various  forms,  with  subtle  differences,  but  here 
is  one  (simplified)  version  of  a  design  argument. 

Anything  with  complex  independently  interrelated  parts  was  designed.  If 
something  is  designed,  then  there  is  an  intelligent  designer.  All  living  organisms 
have  complex  independently  interrelated  parts.  There  are  living  organisms. 
Therefore,  there  is  an  intelligent  designer. 

Symbolize  this  argument,  and  prove  that  it  is  valid.  (The  second  sentence  is  perhaps 
best  symbolized  not  using  one  of  the  eight  forms,  but  rather  using  a  conditional,  where 
both  the  antecedent  and  the  consequent  are  existential  sentences.)  Do  you  believe  this 
argument  is  sound?  Why  do  you  think  Darwin’s  work  was  considered  a  significant 
challenge  to  the  claim  that  the  argument  is  sound? 


14.  Universal  derivation 


14.1  An  example:  the  Meno 

In  one  of  Plato’s  dialogues,  the  Meno,  Socrates  uses  questions  and  prompts  to  direct  a  young 
slave  boy  in  the  process  of  making  a  square  that  has  twice  the  area  of  a  given  square,  by 
using  the  diagonal  of  the  given  square  as  a  side  in  the  new  square.  Socrates  draws  a  square 
1  foot  on  a  side  in  the  dirt.  The  young  boy  at  first  just  suggests  that  to  double  its  area,  the 
two  sides  of  the  square  should  be  doubled,  but  Socrates  shows  him  that  this  would  result 
in  a  square  that  is  four  times  the  area  of  the  given  square;  that  is,  a  square  of  the  size  four 
square  feet.  Next,  Socrates  takes  this  2x2  square,  which  has  four  square  feet,  and  shows  the 
boy  how  to  make  a  square  double  its  size. 

Socrates:  Tell  me,  boy,  is  not  this  a  square  of  four  feet  that  I  have  drawn? 

Boy:  Yes. 

Socrates:  And  now  I  add  another  square  equal  to  the  former  one? 

Boy:  Yes. 

Socrates:  And  a  third,  which  is  equal  to  either  of  them? 

Boy:  Yes. 

Socrates:  Suppose  that  we  fill  up  the  vacant  corner? 

Boy:  Very  good. 

Socrates:  Here,  then,  there  are  four  equal  spaces? 

Boy:  Yes.[12] 

So  what  Socrates  has  drawn  at  this  point  looks  like: 


Suppose  each  square  is  a  foot  on  a  side.  Socrates  will  now  ask  the  boy  how  to  make  a  square 
that  is  of  eight  square  feet,  or  twice  the  size  of  their  initial  2x2  square.  Socrates  has  a  goal 
and  method  in  drawing  the  square  four  times  the  size  of  the  original. 

Socrates:  And  how  many  times  larger  is  this  space  than  the  other? 

Boy:  Four  times. 

Socrates:  But  it  ought  to  have  been  twice  only,  as  you  will  remember. 

Boy:  True. 
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Socrates:  And  does  not  this  line,  reaching  from  comer  to  corner,  bisect  each  of 
these  spaces? 

By  “spaces”,  Socrates  means  each  of  the  2x2  squares.  Socrates  has  now  drawn  the  following: 


Boy:  Yes. 

Socrates:  And  are  there  not  here  four  equal  lines  that  contain  this  new  square? 
Boy:  There  are. 

Socrates:  Look  and  see  how  much  this  new  square  is. 

Boy:  I  do  not  understand. 

After  some  discussion,  Socrates  gets  the  boy  to  see  that  where  the  new  line  cuts  a  small 
square,  it  cuts  it  in  half.  So,  adding  the  whole  small  squares  inside  this  new  square,  and 
adding  the  half  small  squares  inside  this  new  square,  the  boy  is  able  to  answer. 

Socrates:  The  new  square  is  of  how  many  feet? 

Boy:  Of  eight  feet. 

Socrates:  And  from  what  line  do  you  get  this  new  square? 

Boy:  From  this.  [The  boy  presumably  points  at  the  dark  line  in  our  diagram.] 
Socrates:  That  is,  from  the  line  which  extends  from  corner  to  comer  of  the  each 
of  the  spaces  of  four  feet? 

Boy:  Yes. 

Socrates:  And  that  is  the  line  that  the  educated  call  the  “diagonal”.  And  if  this  is 
the  proper  name,  then  you,  Meno’s  slave,  are  prepared  to  affirm  that  the  double 
space  is  the  square  of  the  diagonal? 

Boy:  Certainly,  Socrates. 

For  the  original  square  that  was  2x2  feet,  by  drawing  a  diagonal  of  the  square  we  were  able 
to  draw  one  side  of  a  square  that  is  twice  the  area.  Socrates  has  demonstrated  how  to  make 
a  square  twice  the  area  of  any  given  square:  make  the  new  square’s  sides  each  as  large  as  the 
diagonal  of  the  given  square. 

It  is  curious  that  merely  by  questioning  the  slave  (who  would  have  been  a  child  of  a  Greek 
family  defeated  in  battle,  and  would  have  been  deprived  of  any  education),  Socrates  is  able 
to  get  him  to  complete  a  proof.  Plato  takes  this  as  a  demonstration  of  a  strange  metaphysical 
doctrine  that  each  of  us  once  knew  everything  and  have  forgotten  it,  and  now  we  just 
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need  to  be  helped  to  remember  the  truth.  But  we  should  note  a  different  and  interesting 
fact.  Neither  Socrates  nor  the  slave  boy  ever  doubts  that  Socrates’s  demonstration  is  true 
of  all  squares.  That  is,  while  Socrates  draws  squares  in  the  dirt,  the  slave  boy  never  says, 
“Well,  Socrates,  you’ve  proved  that  to  make  a  square  twice  as  big  as  this  square  that  you 
have  drawn,  I  need  to  take  the  diagonal  of  this  square  as  a  side  of  my  new  square.  But  what 
about  a  square  that’s  much  smaller  or  larger  than  the  one  you  drew  here?” 

That  is  in  fact  a  very  perplexing  question.  Why  is  Socrates’s  demonstration  good  for  all,  for 
any,  squares? 


14.2  A  familiar  strangeness 

We  have  saved  for  last  the  most  subtle  issue  about  reasoning  with  quantifiers:  how  shall  we 
prove  something  is  universally  true? 

Consider  the  following  argument.  We  will  assume  a  first  order  logical  language  that  talks 
about  numbers,  since  it  is  sometimes  easier  to  imagine  something  true  of  everything  in  our 
domain  of  discourse  if  we  are  talking  about  numbers. 

All  numbers  evenly  divisible  by  eight  are  evenly  divisible  by  four. 

All  numbers  evenly  divisible  by  four  are  evenly  divisible  by  two. 

All  numbers  evenly  divisible  by  eight  are  evenly  divisible  by  two. 

Let  us  assume  an  implicit  translation  key,  and  then  we  can  say  that  the  following  is  a 
translation  of  this  argument. 

Vx(Fx— >Gx) 

Vx(Gx^Hx) 


Vx(Fx^Hx) 

This  looks  like  a  valid  argument.  Indeed,  it  may  seem  obvious  that  it  is  valid.  But  to  prove 
it,  we  need  some  way  to  be  able  to  prove  a  universal  statement. 

But  how  could  we  do  such  a  thing?  There  are  infinitely  many  numbers,  so  surely  we 
cannot  check  them  all.  How  do  we  prove  that  something  is  true  of  all  numbers,  without 
taking  an  infinite  amount  of  time  and  creating  an  infinitely  long  proof? 

The  odds  are  that  you  already  know  how  to  do  this,  although  you  have  never  reflected  on 
your  ability.  You  most  likely  saw  a  proof  of  a  universal  claim  far  back  in  grade  school,  and 
without  reflection  concluded  it  was  good  and  proper.  For  example,  when  you  were  first 
taught  that  the  sum  of  the  interior  angles  of  a  triangle  is  equivalent  to  two  right  angles,  you 
might  have  seen  a  proof  that  used  a  single  triangle  as  an  illustration.  It  might  have  gone 
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something  like  this:  assume  lines  AB  and  CD  are  parallel,  and  that  two  other  line  segments 
EF  and  EG  cross  those  parallel  lines,  and  meet  on  AB  at  E.  Assume  also  that  the  alternate 
angles  for  any  line  crossing  parallel  lines  are  equal.  Assume  that  a  line  is  equivalent  to  two 
right  angles,  or  180  degrees.  Then,  in  the  following  picture,  b’=b,  c’=c,  and  b’+c’+a=180 
degrees.  Thus,  a+b+c=180  degrees. 


E 


Most  of  us  think  about  such  a  proof,  see  the  reasoning,  and  agree  with  it.  But  if  we  reflect 
for  a  moment,  we  should  see  that  it  is  quite  mysterious  why  such  a  proof  works.  That’s 
because,  it  aims  to  show  us  that  the  sum  of  the  interior  angles  of  any  triangle  is  the  same 
as  two  right  angles.  But  there  are  infinitely  many  triangles  (in  fact,  logicians  have  proved 
that  there  are  more  triangles  than  there  are  natural  numbers!).  So  how  can  it  be  that  this 
argument  proves  something  about  all  of  the  triangles?  Furthermore,  in  the  diagram  above, 
there  are  infinitely  many  different  sets  of  two  parallel  lines  we  could  have  used.  And  so  on. 

This  also  touches  on  the  case  that  we  saw  in  the  Meno.  Socrates  proves  that  the  area  of  a 
square  A  twice  as  big  as  square  B  does  not  simply  have  sides  twice  as  long  as  the  sides  of  B; 
rather,  each  side  of  A  must  be  the  length  of  the  diagonal  of  B.  But  he  and  the  boy  drew 
just  one  square  in  the  dirt.  And  it  won’t  even  be  properly  square.  How  can  they  conclude 
something  about  every  square  based  on  their  reasoning  and  a  crude  drawing? 

In  all  such  cases,  there  is  an  important  feature  of  the  relevant  proof.  Squares  come  in  many 
sizes,  triangles  come  in  many  sizes  and  shapes.  But  what  interests  us  in  such  proofs  is  all  and 
only  the  properties  that  all  triangles  have,  or  all  and  only  properties  that  all  squares  have. 
We  refer  to  a  triangle,  or  a  square,  that  is  abstract  in  a  strange  way:  we  draw  inferences 
about,  and  only  refer  to,  its  properties  that  are  shared  with  all  the  things  of  its  kind.  We  are 
really  considering  a  special,  generalized  instance. 

We  can  call  this  special  instance  the  “arbitrary  instance”.  If  we  prove  something  is  true  of 
the  arbitrary  triangle,  then  we  conclude  it  is  true  of  all  triangles.  If  we  prove  something  is 
true  of  the  arbitrary  square,  then  we  conclude  it  is  true  of  all  squares.  If  we  prove  something 
is  true  of  an  arbitrary  natural  number,  then  we  conclude  it  is  true  of  all  natural  numbers. 
And  so  on. 


14.3  Universal  derivation 

To  use  this  insight,  we  will  introduce  not  an  inference  rule,  but  rather  a  new  proof  method. 
We  will  call  this  proof  method  “universal  derivation”  or,  synonymously,  “universal  proof”. 
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We  need  something  to  stand  for  the  arbitrary  instance.  For  a  number  of  reasons,  it  is 
traditional  to  use  unbound  variables  for  this.  However,  to  make  it  clear  that  the  variable  is 
being  used  in  this  special  way,  and  that  the  well-formed  formula  so  formed  is  a  sentence, 
we  will  use  a  prime — that  is,  the  small  mark  — to  mark  the  variable.  Let  a  be  any  variable. 
Our  proof  method  thus  looks  like  this. 


|  0(aO 

Vct0(a) 

Where  a'  does  not  appear  in  any  open  proof  above  the  beginning  of  the  universal 
derivation. 

Remember  that  an  open  proof  is  a  subproof  that  is  not  completed. 

We  will  call  any  symbolic  term  of  this  form  (x',  f,  z'. . .)  an  “arbitrary  term”,  and  it  is  often 
convenient  to  describe  it  as  referring  to  the  arbitrary  object  or  arbitrary  instance.  But  there 
is  not  any  one  object  in  our  domain  of  discourse  that  such  a  term  refers  to.  Rather,  it  stands 
in  for  an  abstraction:  what  all  the  things  in  the  domain  of  discourse  have  in  common. 

The  semantics  of  an  arbitrary  instance  is  perhaps  less  mysterious  when  we  consider  the 
actual  syntactic  constraints  on  a  universal  derivation.  One  should  not  be  able  to  say 
anything  about  an  arbitrary  instance  a'  unless  one  has  done  universal  instantiation  of  a 
universal  claim.  No  other  sentence  should  allow  claims  about  a'.  For  example,  you  cannot 
perform  existential  instantiation  to  an  arbitrary  instance,  since  we  required  that  existential 
instantiation  be  done  to  special  indefinite  names  that  have  not  appeared  yet  in  the  proof. 
But  if  we  can  only  makes  claims  about  a'  using  universal  instantiation,  then  we  will  be 
asserting  something  about  a'  that  we  could  have  asserted  about  anything  in  our  domain  of 
discourse.  Seen  in  this  way,  from  the  perspective  of  the  syntax  of  our  proof,  the  universal 
derivation  hopefully  seems  very  intuitive. 

This  schematic  proof  has  a  line  where  we  indicate  that  we  are  going  to  use  a'  as  the 
arbitrary  object,  by  putting  a'  in  a  box.  This  is  not  necessary,  and  is  not  part  of  our  proof. 
Rather,  like  the  explanations  we  write  on  the  side,  it  is  there  to  help  someone  understand 
our  proof.  It  says,  this  is  the  beginning  of  a  universal  derivation,  and  a'  stands  for  the  arbitrary 
object.  Since  this  is  not  actually  a  line  in  the  proof,  we  need  not  number  it. 

We  can  now  prove  our  example  above  is  valid. 
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1.  Vx(Fx  — >  Gx)  premise 

2.  Vx(Gx— >■  Hx)  premise 


x7 


3.  Fa7 

4.  (Fa7  ->•  Ga7) 

5.  Ga7 

6.  (Ga7  Ha7) 

7.  Ha7 

8.  (Fa7  ->  Ha7) 

9.  Vx(Fx  — >  Hx) 


assumption  for  conditional  derivation 

universal  instantiation,  1 
modus  ponens,  4,  3 
universal  instantiation,  2 
modus  ponens,  6,  5 
conditional  derivation,  3-7 
universal  derivation,  3-8 


Remember  that  our  specification  of  the  proof  method  has  a  special  condition,  that  a'  must 
not  appear  earlier  in  an  open  proof  (a  proof  that  is  still  being  completed).  This  helps  us 
avoid  confusing  two  or  more  arbitrary  instances.  Here,  there  is  no  x'  appearing  above  our 
universal  derivation  in  an  open  proof  (in  fact,  there  is  no  other  arbitrary  instance  appearing 
in  the  proof  above  x'),  so  we  have  followed  the  rule. 

14.4  Two  useful  theorems:  quantifier  equivalence 

Our  definition  of  “theorem”  remains  the  same  for  the  first  order  logic  and  for  the 
propositional  logic:  a  sentence  that  can  be  proved  without  premises.  However,  we  now 
have  a  distinction  when  it  comes  to  the  semantics  of  sentences  that  must  be  true.  Generally, 
we  think  of  a  tautology  as  a  sentence  that  must  be  true  as  a  function  of  the  truth-functional 
connectives  that  constitute  that  sentence.  That  is,  we  identified  that  a  tautology  must  be 
true  by  making  a  truth  table  for  the  tautology.  There  are,  however,  sentences  of  the  first 
order  logic  that  must  be  true,  but  we  cannot  demonstrate  this  with  a  truth  table.  Here  is  an 
example: 


Vx(F  X  V  ->Fx) 

This  sentence  must  be  true.  But  we  cannot  show  this  with  a  truth  table.  Instead,  we 
need  the  concept  of  a  model  (introduced  briefly  in  section  17.6)  to  describe  this  property 
precisely.  But  even  with  our  intuitive  semantics,  we  can  see  that  this  sentence  must  be 
true.  For,  we  require  (in  our  restriction  on  predicates)  that  everything  in  our  domain  of 
discourse  either  is,  or  is  not,  an  F. 

We  call  a  sentence  of  the  first  order  logic  that  must  be  true,  “logically  true”.  Just  as  it  was  a 
virtue  of  the  propositional  logic  that  all  the  theorems  are  tautologies,  and  all  the  tautologies 
are  theorems;  it  is  a  virtue  of  our  first  order  logic  that  all  the  theorems  are  logically  true,  and 
all  the  logically  true  sentences  are  theorems.  Proving  this  is  beyond  the  scope  of  this  book, 
but  is  something  done  in  most  advanced  logic  courses  and  texts. 


Here  is  a  proof  that  Vx(Fx  v  -iFx). 


14.  Universal  derivation  153 


1.  ^(Fa7  V^Fa7) 

2.  ^Fa7 

3.  (Fa/  V  ->Fa/) 

4.  ->(Fa7  V  -iFa/) 

5.  Fa/ 

6.  (Fa/  V  — iFa/) 

7.  — i(Fa/  V  -iFa/) 

8.  (Fa7  V^Fa7) 

9.  Vx(Fa:  V  -iFx) 


assumption  for  indirect  derivation 
assumption  for  indirect  derivation 

addition,  2 
repeat,  1 

indirect  derivation,  2-4 
addition,  5 
repeat,  1 

indirect  derivation,  1-7 
universal  derivation,  1-8 


Let  us  consider  another  example  of  a  logically  true  sentence  that  we  can  prove,  and 
thus,  practice  universal  derivation.  The  following  sentence  is  logically  true. 


((Vx  (Fx  -»  Gx)  A  Vx  (Fx  -»  Hx))^  Vx  (Fx  -»  (Gx  AHx)) 


Here  is  a  proof.  The  formula  is  a  conditional,  so  we  will  use  conditional  derivation. 
However,  the  consequence  is  a  universal  sentence,  so  we  will  need  a  universal  derivation 
as  a  subproof. 


154  A  Concise  Introduction  to  Logic 


1.  (Vx( Fx  — >  Gx )  A  Vx(Fx  — >  Hx)) 


x! 


2.  Fa7 

3.  Vx(Fx  -A  Gx) 

4.  (Fa7  -A  Ga/) 

5.  Ga/ 

6.  Vx(Fx  — Hx) 

7.  (Fa7  -A  Ha7) 

8.  Ha7 

9.  (Ga7  A  Ha7) 

10.  (Fa7  -A  (Ga7  A  Ha7)) 

11.  Vx(Fx  -A  (Gx  A  Hx)) 

((Vx(Fx  -A  Gx)  A  Vx(Fx  — »  Hx))  -A 
Vx(Fx— >•  (GxA 


assumption  for  conditional  derivation 


assumption  for  conditional  derivation 

simplification,  1 
universal  instantiation,  3 
modus  ponens,  4,  2 
simplification,  1 
universal  instantiation,  6 
modus  ponens,  7,  2 
adjunction,  5,  8 

conditional  derivation,  2-9 

universal  derivation,  2-10 

conditional  derivation,  1-11 


Just  as  there  were  useful  theorems  of  the  propositional  logic,  there  are  many  useful  theorems 
of  the  first  order  logic.  Two  very  useful  theorems  concern  the  relation  between  existential 
and  universal  claims. 


(3xFx  iVx-iFx) 

(VxFx  i3x-iFx) 

Something  is  F  just  in  case  not  everything  is  not  F.  And,  everything  is  F  if  and  only  if  not 
even  one  thing  is  not  F. 

We  can  prove  the  second  of  these,  and  leave  the  first  as  an  exercise. 
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1.  VxFx 

2.  3  am Fx 

3.  -iF p 

4.  F p 

5.  Sx-^Fx 

6.  (VxFx  — »  -UomFx) 

7.  ->3:tmFx 

I  Ed 

8.  -i Fx ’ 

9.  3x->Fx 

10.  -fix-Tx 

11.  Fx’ 

12.  VxFx 

13.  (-i3x-iFx  — >  VxFx) 

14.  (VxFxo  ->3amFx)) 


assumption  for  conditional  derivation 
assumption  for  indirect  derivation 

existential  instantiation,  2 
universal  instantiation,  1 
indirect  derivation,  2-4 

conditional  derivation  1-5 
assumption  for  conditional  derivation 


assumption  for  indirect  derivation 

existential  generalization,  8 
repeat,  7 

indirect  derivation,  8-10 
universal  derivation,  8-11 
conditional  derivation  7-12 
bicondition,  6,  13 


14.5  Illustrating  invalidity 

Consider  die  following  argument: 

Vx(Hx^Gx) 


->Hd 


iG  d 

This  is  an  invalid  argument.  It  is  possible  that  the  conclusion  is  false  but  the  premises  are 
true. 

Because  we  cannot  use  truth  tables  to  describe  the  semantics  of  quantifiers,  we  have  kept 
the  semantics  of  the  quantifiers  intuitive.  A  complete  semantics  for  first  order  logic  is  called 
a  “model”,  and  requires  some  set  theory.  This  presents  a  difficulty:  we  cannot  demonstrate 
that  an  argument  using  quantifiers  is  invalid  without  a  semantics. 

Fortunately,  there  is  a  heuristic  method  that  we  can  use  that  does  not  require  developing  a 
full  model.  We  will  develop  an  intuitive  and  partial  model.  The  idea  is  that  we  will  come 
up  with  an  interpretation  of  the  argument,  where  we  ascribe  a  meaning  to  each  predicate, 
and  a  referent  for  each  term,  and  where  this  interpretation  makes  the  premises  obviously 
true  and  the  conclusion  obviously  false.  This  is  not  a  perfect  method,  since  it  will  depend 


156  A  Concise  Introduction  to  Logic 


upon  our  understanding  of  our  interpretation,  and  because  it  requires  us  to  demonstrate 
some  creativity.  But  this  method  does  illustrate  important  features  of  the  semantics  of  the 
first  order  logic,  and  used  carefully  it  can  help  us  see  why  a  particular  argument  is  invalid. 

It  is  often  best  to  create  an  interpretation  using  numbers,  since  there  is  less  vagueness  of 
the  meaning  of  the  predicates.  So  suppose  our  domain  of  discourse  is  the  natural  numbers. 
Then,  we  need  to  find  an  interpretation  of  the  predicates  that  makes  the  first  two  lines  true 
and  the  conclusion  false.  Here  is  one: 

Hx:  x  is  evenly  divisible  by  2 

Gx:  x  is  evenly  divisible  by  1 

d:  3 

The  argument  would  then  have  as  premises:  All  numbers  evenly  divisible  by  2  are  evenly 
divisible  by  1;  and,  3  is  not  evenly  divisible  by  2.  These  are  both  true.  But  the  conclusion 
would  be:  3  is  not  evenly  divisible  by  1.  This  is  false.  This  illustrates  that  the  argument 
form  is  invalid. 

Let  us  consider  another  example.  Here  is  an  invalid  argument: 

Vx(Fx->Gx) 

F  a 


G  b 

We  can  illustrate  that  it  is  invalid  by  finding  an  interpretation  that  shows  the  premises 
true  and  the  conclusion  false.  Our  domain  of  discourse  will  be  the  natural  numbers.  We 
interpret  the  predicates  and  names  in  the  following  way: 

Fx:  x  is  greater  than  10 

Gx:  x  is  greater  than  5 

a:  15 

b:  2 

Given  this  interpretation,  the  argument  translates  to:  Any  number  greater  than  10  is  greater 
than  5;  15  is  greater  than  10;  therefore,  2  is  greater  than  5.  The  conclusion  is  obviously 
false,  whereas  the  premises  are  obviously  true. 

In  this  exercise,  it  may  seem  strange  that  we  would  just  make  up  meanings  for  our 
predicates  and  names.  However,  as  long  as  our  interpretations  of  the  predicates  and  names 
follow  our  rules,  our  interpretation  will  be  acceptable.  Recall  the  rules  for  predicates  are 
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that  they  have  an  arity,  and  that  each  predicate  of  arity  n  is  true  or  false  (never  both,  never 
neither)  of  each  n  things  in  the  domain  of  discourse.  The  rule  for  names  is  that  they  refer 
to  only  one  object. 

This  illustrates  an  important  point.  Consider  a  valid  argument,  and  try  to  come  up  with 
some  interpretation  that  makes  it  invalid.  You  will  find  that  you  cannot  do  it,  if  you  respect 
the  constraints  on  predicates  and  names.  Make  sure  that  you  understand  this.  It  will  clarify 
much  about  the  generality  of  the  first  order  logic.  Take  a  valid  argument  like: 

Vx(Fx^Gx) 

F  a 


Gu 

Come  up  with  various  interpretations  for  a  and  for  F  and  G.  You  will  find  that  you  cannot 
make  an  invalid  argument. 

In  summary,  an  informal  model  used  to  illustrate  invalidity  must  have  three  things: 

1.  a  domain  of  discourse; 

2.  an  interpretation  of  the  predicates;  and 

3.  an  interpretation  of  the  names. 

If  you  can  find  such  an  informal  model  that  makes  the  premises  obviously  true  and 
the  conclusion  obviously  false,  you  have  illustrated  that  the  argument  is  invalid.  This 
may  take  several  tries:  you  can  also  sometimes  come  up  with  interpretations  for  invalid 
arguments  that  make  all  the  premises  and  the  conclusion  true;  this  is  not  surprising,  when 
you  remember  the  definition  of  valid  (that  necessarily,  if  the  premises  are  true  then  the 
conclusion  is  true — in  other  words,  it  is  not  enough  that  the  conclusion  just  happens  to  be 
true). 


14.6  Problems 

1.  Prove  the  following.  These  will  require  universal  derivation.  (For  the  third,  remember 
that  the  variables  used  in  quantifiers  are  merely  used  to  indicate  the  place  in  the 
following  expression  that  is  being  bound.  So,  if  we  change  the  variable  nothing  else 
changes  in  our  proof  or  use  of  inference  rules.)  The  last  three  are  challenging.  For 
these  last  three  problems,  do  not  use  the  quantifier  negation  rules. 

a.  Premises:  VxFx,  Vx  (Fx  Gx).  Conclusion:  VxGx. 

b.  Premises:  Vx(Fx  -»  Gx).  Conclusion:  Vx(iGx  -» iFx). 

c.  Premises:  Vx(Fx  Hx),  Vy(Hy  Gy).  Conclusion:  V^(Fz  G^r). 

d.  Conclusion:  (Vx(-iFx  v  Gx)  -»  Vx(Fx  -»  Gx)). 

e.  Conclusion:  (Vx(Fx  ^Gx)  -»  (VxFx  <-»  VxGx)). 

f.  Conclusion:  (-GxFx  Vx-iFx). 


158  A  Concise  Introduction  to  Logic 

g.  Conclusion:  (-iVxFx  3x-iFx). 

h.  Conclusion:  (3xFx  iVx-iFx). 

2.  Create  a  different  informal  model  for  each  of  the  following  arguments  to  illustrate  that 
it  is  invalid. 

a.  Premises:  Vx(Fx  -»  Gx),  ->Ga.  Conclusion:  -iF b. 

b.  Premises:  Vx(F  xv  Gx),  -iF a.  Conclusion:  Gb. 

c.  Premises:  Vx(Fx  -»  Gx),  3xFx.  Conclusion:  Gc. 

3.  In  normal  colloquial  English,  write  your  own  valid  argument  with  at  least  two 
premises  and  with  a  conclusion  that  is  a  universal  statement.  Y our  argument  should 
just  be  a  paragraph  (not  an  ordered  list  of  sentences  or  anything  else  that  looks  like 
formal  logic).  Translate  it  into  first  order  logic  and  prove  it  is  valid. 

4.  Do  we  have  free  will?  Much  of  the  work  that  philosophers  have  done  to  answer  this 
question  focuses  on  trying  to  define  or  understand  what  free  will  would  be,  and 
understand  the  consequences  if  we  do  not  have  free  will.  Doubts  about  free  will  have 
often  been  raised  by  those  who  believe  that  physics  will  ultimately  explain  all  events 
using  deterministic  laws,  so  that  everything  had  to  happen  one  way.  Here  is  a 
simplified  version  of  such  an  argument. 

Every  event  is  caused  by  prior  events  by  way  of  natural  physical  laws.  Any  event 
caused  by  prior  events  by  way  of  natural  physical  laws  could  not  have  happened 
otherwise.  But,  if  all  events  could  not  have  happened  otherwise,  then  there  is  no 
freely  willed  event.  We  conclude,  therefore,  that  there  are  no  freely  willed  events. 

Symbolize  this  argument  and  prove  it  is  valid.  You  might  consider  using  the  following 
predicates: 

Fx:  x  is  an  event. 

Gx:  x  is  caused  by  prior  events  by  way  of  natural  physical  laws. 

Hx:  x  could  have  happened  otherwise. 

Ix:  x  is  a  freely  willed  event. 

(Hint:  this  argument  will  require  universal  derivation.  The  conclusion  can  be  had  using 
modus  ponens,  if  you  can  prove:  all  events  could  not  have  happened  otherwise.)  Do  you 
believe  that  this  argument  is  sound? 


[12]  These  passages  are  adapted  from  the  Benjamin  Jowett  translation  of  the  Meno. 
Versions  of  this  translation  are  available  for  free  on  the  web.  Students  hoping  to  read  other 
works  by  Plato  should  consider  Cooper  and  Hutchinson  (1997). 


15.  Relations,  functions,  identity, 
and  multiple  quantifiers 


15.1  Relations 

We  have  developed  a  first  order  logic  that  is  sufficient  to  describe  many  things.  The  goal 
of  this  chapter  is  to  discuss  ways  to  extend  and  apply  this  logic.  We  will  introduce  relations 
and  functions,  make  some  interesting  observations  about  identity,  and  discuss  how  to  use 
multiple  quantifiers. 

Recall  that  if  we  have  a  predicate  of  arity  greater  than  one,  we  sometimes  call  that  a 
“relation”.  An  arity  one  predicate  like  "...  is  tall”  does  not  relate  things  in  our  domain  of 
discourse.  Rather,  it  tells  us  about  a  property  of  a  thing  in  our  domain  of  discourse.  But 
an  arity  two  predicate  like  "...  is  taller  than  ...”  relates  pairs  of  things  in  our  domain  of 
discourse. 

More  generally,  we  can  think  of  a  relation  as  a  set  of  ordered  things  from  our  domain  of 
discourse.  An  arity  two  relation  is  thus  a  collection  of  ordered  pairs  of  things;  the  relation 
“. .  .is  taller  than. . .”  would  be  all  the  ordered  pairs  of  things  where  the  first  is  taller  than  the 
second.  The  predicate  “. . .  is  taller  than. . .”  would  be  true  of  all  these  things.  The  relation 
"...  sits  between  ...  and  ...”  would  be  the  collection  of  all  the  triples  of  things  where  the 
first  sat  between  the  second  and  third.  The  predicate  "...  sits  between  ...  and  ...”  would 
be  true  of  all  of  these  things. 

Logicians  have  developed  a  host  of  useful  ways  of  talking  about  relations,  especially 
relations  between  just  two  things.  We  can  illustrate  this  with  an  example.  Hospitals  and 
other  medical  treatment  facilities  often  need  blood  for  transfusions.  But  not  any  kind  of 
blood  can  do.  One  way  to  classify  kinds  of  blood  is  using  the  ABO  system.  This  divides 
kinds  of  blood  into  four  groups:  A,  B,  AB,  and  O.  This  classification  describes  antigens  on 
the  surface  of  the  blood  cells.  It  is  a  very  useful  classification,  because  some  people  have  an 
immune  system  that  will  not  tolerate  the  antigens  on  other  kinds  of  blood.  This  tolerance 
is  determined  by  one’s  blood  group. 

Those  with  type  O  blood  can  give  blood  to  anyone,  without  causing  an  immune  reaction. 
Those  with  type  A  can  give  blood  to  those  with  type  A  and  type  AB.  Those  with  type  B 
can  give  blood  to  those  with  type  B  and  type  AB.  And  those  with  type  AB  can  only  give 
to  type  AB.  Let  arrows  mean,  can  be  given  to  people  with  this  blood  type  without  causing  an 
allergic  reaction  in  the  following  diagram: 


159 
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Notice  a  number  of  things.  First,  every  blood  type  can  share  blood  with  people  of  that 
blood  type.  But  it  is  not  always  the  case  that  if  I  can  share  blood  with  you,  you  can  share 
blood  with  me:  it  could  be  that  I  am  type  O  and  you  are  type  B.  Also,  if  I  can  share  blood 
with  you,  and  you  can  share  blood  with  Tom,  then  I  can  share  blood  with  Tom. 

The  first  feature  of  relations  is  called  “reflexive”.  For  any  relation  the  relation  is 

reflexive  if  and  only  if: 

Vx<t>(xx) 

Examples  of  reflexive  relations  in  English  include  “...is  as  old  as...”.  Each  person  is  as  old 
as  herself.  A  relation  that  is  not  reflexive  is,  “...is  older  than  ...”.  No  person  is  older  than 
herself. 

For  any  relation  5>,  the  relation  is  symmetric  if  and  only  if: 

VxVy(5>(x  y)  -»  $>(y  x)) 

Examples  of  symmetric  relations  in  English  include  “...is  married  to...”.  In  our  legal 
system  at  least,  if  Pat  is  married  to  Chris,  then  Chris  is  married  to  Pat. 

Finally,  call  a  relation  “transitive”  if  and  only  if 

VxVyVz((<t>(x  y)A$>(y  z))  -»  5>(x  z)) 

Examples  of  transitive  relations  in  English  include  .  .is  older  than. . If  Tom  is  older  than 
Steve,  and  Steve  is  older  than  Pat,  then  Tom  is  older  than  Pat. 

Return  now  to  our  example  of  blood  types.  We  introduce  the  following  translation  key: 
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Gxy:  x  can  give  blood  to  y  without  causing  an  immune  reaction 
It  is  the  case  that 
VxGxx 

And  so  we  know  that  the  relation  G  is  reflexive.  A  person  with  type  O  blood  can  give 
blood  to  himself,  a  person  with  type  AB  blood  can  give  blood  to  herself,  and  so  on.  (People 
do  this  when  they  store  blood  before  a  surgery.)  Is  the  relation  symmetric?  Consider 
whether  the  following  is  true: 

VxVy(Gxy  -»  Gyx) 

A  moment’s  reflection  reveals  this  isn’t  true.  A  person  of  type  O  can  give  blood  to  a  person 
of  type  AB,  but  the  person  with  type  AB  blood  cannot  give  blood  to  the  person  with  type 
O,  without  potentially  causing  a  reaction.  So  G  is  not  symmetric. 

Finally,  to  determine  if  G  is  transitive,  consider  whether  the  following  is  true. 

VxVyVz((Gxy  A  Gyz)  -»  Gxz) 

A  person  with  type  O  blood  can  give  blood  to  a  person  with  type  A  blood,  and  that  person 
can  give  blood  to  someone  of  type  AB.  Does  it  follow  that  the  person  with  type  O  can  give 
blood  to  the  person  with  AB?  It  does.  And  similarly  this  is  so  for  all  other  possible  cases. 
G  is  a  transitive  relation. 


15.2  Functions 

A  function  is  a  kind  of  relation  between  two  or  more  things.  What  all  functions  have  in 
common  is  that  they  relate  some  specific  number  of  things  to  precisely  one  thing.  A  familiar 

example  from  mathematics  might  be  the  squaring  function.  We  write  for  this,  n2.  This 
takes  one  number,  say  7,  and  then  relates  it  to  exactly  one  other  number,  49.  Or  addition  is 
a  function  that  takes  two  numbers  and  relates  them  to  exactly  one  other  number,  their  sum. 

The  idea  of  a  function  is  very  general,  and  extends  beyond  mathematics.  For  example,  each 
of  the  following  could  be  a  function  (if  certain  things  are  assumed  beforehand): 

the  mother  of. . . 

the  father  of. . . 

Think  of  how  you  could  use  something  like  this  in  our  logical  language.  You  could  say, 
“The  father  of  Tom  is  Canadian”.  But  now,  who  is  Canadian?  Not  Tom.  Tom’s  father  is. 
In  this  sentence,  “the  father  of...”  acts  as  a  function.  It  relates  a  person  to  another  person. 
And,  in  our  predicate,  “the  father  of  Tom”  acts  like  a  name,  in  that  it  refers  to  one  thing. 

Functions  have  an  arity.  Addition  is  an  arity  two  function;  it  takes  two  objects  in  order 
to  form  a  symbolic  term.  But  in  order  to  be  a  function,  the  resulting  symbolic  term  must 
always  refer  to  only  one  object.  (This  rule  gets  broken  a  lot  in  mathematics,  where  some 
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relations  are  called  “functions”  but  can  have  more  than  one  output.  This  arises  because 
in  circumscribed  domains  of  discourse,  those  operations  are  functions,  and  then  they  get 
applied  in  new  domains  but  are  still  called  “functions”.  Thus,  the  square  root  function  is 
a  function  when  we  are  studying  the  natural  numbers,  but  once  we  introduce  negative 
numbers  it  no  longer  is  a  function.  But  mathematicians  call  it  a  “function”  because  it  is  in 
limited  domains  a  function,  and  because  the  diverse  output  is  predictable  in  various  ways. 
Logic  is  the  only  field  where  one  earns  the  right  to  call  mathematicians  sloppy.) 

Functions  are  surprisingly  useful.  Computers,  for  example,  can  be  understood  as  function 
machines,  and  programming  can  be  usefully  described  as  the  writing  of  functions  for  the 
computer.  Much  of  mathematics  is  concerned  with  studying  functions,  and  they  often 
prove  useful  for  studying  other  things  in  mathematics  that  are  not  themselves  functions. 

We  can  add  functions  to  our  logical  language.  We  will  let  f,  g,  h,  ...  be  functions.  Each 
function,  as  noted,  has  an  arity.  A  function  of  arity  n  combined  with  n  symbolic  terms  is  a 
symbolic  term. 

Thus,  to  make  a  key  to  translate  the  sentence  above,  we  can  have: 

Kx:  x  is  Canadian, 
fx:  the  father  of  x. 
a\  Tom 
b:  Steve 

(Obviously,  we  are  assuming  that  each  person  has  only  one  father.  Arguably  that  is  only 
one  use  of  the  word  “father,”  but  our  goal  here  is  to  create  a  familiar  example,  not  to  take 
sides  in  any  issue  about  family  relations.  So  we  will  allow  the  assumption  just  to  make  our 
point.)  Using  that  key,  the  following  would  mean  “Tom  is  Canadian”: 

Ka 

And  the  following  would  mean  “The  father  of  Tom  is  Canadian”: 

Kf a 

We  can  also  say  something  like,  “Tom’s  paternal  grandfather  is  Canadian”: 

Kffa 

Or  even,  “Tom’s  paternal  great-grandfather  is  Canadian”: 

Kfffi? 

That  works  because  the  father  of  the  father  of  Tom  is  the  paternal  grandfather  of  Tom, 
which  then  is  a  symbolic  term,  and  we  can  apply  the  function  to  it.  Recall  that,  when  a  rule 
can  be  applied  repeatedly  to  its  product,  we  call  this  “recursion”. 
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15.3  Identity 

There  is  one  predicate  that  has  always  been  enormously  useful,  and  which  in  most  logical 
systems  is  singled  out  for  special  attention.  This  is  identity. 

In  English,  the  word  “is”  is  multiply  ambiguous,  and  we  must  sort  out  identity  from 
predication  and  existence.  For  example,  consider  the  following  sentences: 

Malcolm  X  is  a  great  orator. 

Malcolm  X  is  Malcolm  Little. 

Malcolm  X  is. 

The  last  example  is  not  very  common  in  English  usage,  but  it  is  grammatical.  Here  we  see 
the  “is”  of  existence.  The  sentence  asserts  that  Malcolm  X  exists.  In  the  first  sentence,  we 
would  treat  .is  a  great  orator”  as  an  arity  one  predicate.  The  “is”  is  part  of  the  predicate, 
and  in  our  logic,  cannot  be  distinguished  from  the  predicate.  But  the  second  case  uses  the 
“is”  of  identity.  It  asserts  that  Malcolm  X  and  Malcolm  Little  are  the  same  thing. 

Because  it  is  so  common  to  use  the  symbol  “=”  for  identity,  we  will  use  it,  also.  Strictly 
speaking,  our  syntax  requires  prefix  notation.  But  for  any  language  we  create,  we  could 
introduce  an  arity  two  predicate 

Ixy:  x  is  identical  to  y 

And  then  we  could  say,  whenever  we  write  “a=p”  we  really  mean  “lap”. 

Note  identity  describes  a  relation  that  is  reflexive,  symmetric,  and  transitive.  Everything  is 
identical  to  itself.  If  a=b  then  b=a.  And  if  a=b  and  b=c  then  a=c. 

One  special  feature  of  identity  is  that  we  know  that  if  two  things  are  identical,  then 
anything  true  of  one  is  true  of  the  other.  In  fact,  the  philosopher  Leibniz  defined  identity 
with  a  principle  that  we  call  Leibniz’s  Law:  a  and  b  are  identical  if  and  only  if  they  have  all 
and  only  the  same  properties.  Our  logic  must  take  identity  as  a  primitive,  however,  because 
we  have  no  way  in  our  logic  of  saying  “all  properties”  (this  is  what  “first  order”  in  “first 
order  logic”  means:  our  quantifiers  have  only  particular  objects  in  their  domain). 

Leibniz’s  insight,  however,  suggests  an  inference  rule.  If  a  and  p  are  symbolic  terms,  and 
<*>(«)  means  that  5>  is  a  formula  in  which  a  appears,  we  can  say 
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Where  we  replace  one  or  more  occurrences  of  a  in  4>  with  |3.  We  can  call  this  rule, 
“indiscemibility  of  identicals”.  We  sometimes  also  call  this,  “substitution  of  identicals”. 

This  is  an  interesting  kind  of  rule.  Some  logicians  would  call  this  a  “non-logical  rule”.  The 
reason  is,  we  know  it  is  proper  because  we  know  the  meaning  of  “=”.  Unlike,  for  example, 
modus  ponens,  which  identifies  a  logical  relation  between  two  kinds  of  sentences,  this  rule 
relies  not  on  the  semantics  of  a  connective,  but  rather  on  the  meaning  of  a  predicate.  This 
notion  of  “non-logical”  is  a  term  of  art,  but  it  does  seem  profound.  Adding  such  rules  to 
our  language  can  strengthen  it  considerably. 

Adding  identity  to  our  language  will  allow  us  to  translate  some  expressions  that  we  would 
be  unable  to  translate  otherwise.  Consider  our  example  above,  for  functions.  How  would 
we  translate  the  expression,  “Steve  is  the  father  of  Tom”?  We  could  add  to  our  language  a 
predicate,  "...  is  the  father  of  . . .”.  However,  it  is  interesting  that  in  this  expression  (“Steve 
is  the  father  of  Tom”),  the  “is”  is  identity.  A  better  translation  (using  the  key  above)  would 
be: 


b=(a 


We  can  also  say  things  like,  “The  father  of  Steve  is  the  paternal  grandfather  of  Tom”: 

fb=Ka 

Consider  now  a  sentence  like  this:  someone  is  the  father  of  Tom.  Again,  if  we  had  a 
predicate  for  "...  is  the  father  of . . .”,  we  could  just  say,  there  is  something  that  is  the  father 
of  Tom.  But  given  that  we  have  a  function  for  “the  father  of  x”  we  could  also  translate  this 
phrase  as: 


3x  x=£a 

We  can  see  from  these  examples  that  there  are  interesting  parallels  between  relations 
(including  functions)  and  predicates.  To  represent  some  kind  of  function,  we  can  introduce 
a  function  into  our  language,  which  acts  as  a  special  kind  of  symbolic  term,  but  it  is  also 
possible  to  identify  a  predicate  that  is  true  of  all  and  only  those  things  that  the  function 
relates.  Nonetheless,  we  must  be  careful  to  distinguish  between  predicates,  which  when 
combined  with  the  appropriate  number  of  names  form  a  sentence;  and  functions,  which 
when  combined  with  the  appropriate  number  of  other  terms  are  symbolic  terms.  In  our 
logic,  treating  predicates  like  functions  (that  is,  taking  them  as  symbolic  terms  for  other 
predicates)  will  create  nonsense. 

Finally,  we  had  as  an  example  above,  the  sentence  “Malcolm  X  is”.  This  is  equivalent  to 
“Malcolm  X  exists”.  Let  c  mean  Malcolm  X.  Identity  allows  us  to  express  this  sentence.  We 
say  that  there  is  at  least  one  thing  that  is  Malcolm  X: 


3x  x=c 
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15.4  Examples  using  multiple  quantifiers 

We  have  only  just  begun  to  explore  the  power  of  quantifiers.  Consider  now  the  following 
sentences: 

Every  number  is  greater  than  or  equal  to  some  number. 

Some  number  is  greater  than  or  equal  to  every  number. 

Every  number  is  less  than  or  equal  to  some  number. 

Some  number  is  less  than  or  equal  to  every  number. 

Depending  upon  our  domain  of  discourse,  some  of  these  sentences  are  true,  and  some  of 
them  are  false.  Can  we  represent  them  in  our  logical  language? 

Suppose  that  we  introduce  an  arity  two  predicate  for  “greater  than  or  equal  to”: 

Gxy:  x  is  greater  than  or  equal  to  y 

We  can  follow  tradition,  and  use  “>”.  Thus,  when  we  write  “a>[3”  we  understand  that  this 
is  “Gap”.  Let  us  also  assume  that  our  domain  of  discourse  is  the  natural  numbers.  That  is, 
we  are  talking  only  about  1,  2,  3,  4. . .. 

We  can  see  now  how  to  take  a  first  step  toward  expressing  these  sentences.  If  we  write: 

x>y 

We  have  said  that  x  is  greater  than  or  equal  to  y.  We  have  used  quantifiers  to  say  “all”,  and 
we  can  write 

Vx  x>y 

Which  says  that  every  number  is  greater  than  or  equal  to  y.  But  how  will  we  capture  the 
sentences  above?  We  will  need  to  use  multiple  quantifiers.  To  say  that  “every  number  is 
greater  than  some  number”,  we  will  write 

Vx3y  x>y 

This  raises  important  questions  about  how  to  interpret  multiple  quantifiers.  Using  multiple 
quantifiers  expands  the  power  of  our  language  enormously.  ELowever,  we  must  be  very 
careful  to  understand  their  meaning. 

Consider  the  following  two  sentences,  which  will  use  our  key  above. 

Vx3y  x>y 
3  yVx  x>y 

Do  they  have  the  same  meaning? 
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As  we  understand  the  semantics  of  quantifiers,  we  will  say  that  they  do  not.  The  basic 
idea  is  that  we  read  the  quantifiers  from  left  to  right.  Thus,  the  first  sentence  above  should 
be  translated  to  English  as  “Every  number  is  greater  than  or  equal  to  some  number”.  The 
second  sentence  should  be  translated,  “Some  number  is  less  than  or  equal  to  every  number”. 
They  have  very  different  meanings.  Depending  upon  our  domain  of  discourse,  they  could 
have  different  truth  values.  For  example,  if  we  use  the  natural  numbers  as  our  domain  of 
discourse,  the  first  sentence  is  true  and  the  second  sentence  is  true.  However,  if  we  used  the 
integers  for  our  domain  of  discourse,  so  that  we  included  negative  numbers,  then  the  first 
sentence  is  true  but  the  second  sentence  is  false. 

It  may  be  helpful  to  think  of  multiple  quantifiers  in  the  following  way.  If  we  were  to 
instantiate  the  quantifiers,  then,  we  would  work  from  left  to  right.  Thus,  the  first  sentence 
says  something  like,  pick  any  number  in  our  domain  of  discourse,  then  there  will  be  at 
least  one  number  in  our  domain  of  discourse  that  is  less  than  or  equal  to  that  first  number 
that  you  already  picked.  The  second  sentence  says  something  quite  different:  there  is  at 
least  one  number  in  our  domain  of  discourse  such  that,  if  you  pick  that  number,  then  any 
number  in  our  domain  of  discourse  is  greater  than  or  equal  to  it. 

With  this  in  mind,  we  are  now  able  to  translate  the  four  phrases  above.  We  include  the 
English  with  the  translation  to  avoid  any  confusion. 

Every  number  is  greater  than  or  equal  to  some  number. 

Vx3 y  x>y 

Some  number  is  greater  than  or  equal  to  every  number. 

3  xVy  x>y 

Every  number  is  less  than  or  equal  to  some  number. 

Vx3 y  y>x 

Some  number  is  less  than  or  equal  to  every  number. 

3  xVy  y>x 

As  we  noted  above,  the  truth  value  of  these  sentences  can  change  if  we  change  our  domain 
of  discourse.  If  our  domain  of  discourse  is  the  natural  numbers,  then  only  the  second 
sentence  is  false;  this  is  because  for  the  natural  numbers  there  is  a  least  number  (l),  but 
there  is  no  greatest  number.  But  if  our  domain  of  discourse  is  the  integers,  then  the  second 
and  fourth  sentences  are  false.  This  is  because  with  the  negative  numbers,  there  is  no  least 
number:  you  can  always  find  a  lesser  negative  number. 

Hopefully  it  becomes  very  clear  now  why  we  need  the  possibility  of  a  number  of  variables 
for  our  quantifiers.  We  could  not  write  the  expressions  above  if  we  did  not  have  discernibly 
distinct  variables  to  allow  different  quantifiers  to  bind  different  locations  in  the  predicate. 
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15.5  Capturing  specific  quantities 

It  can  be  interesting  to  see  a  powerful  use  of  multiple  quantifiers.  In  1905,  the  British 
philosopher  Bertrand  Russell  (1872-1970)  published  a  very  insightful  and  influential  paper, 
“On  Denoting”.  This  paper,  and  additional  work  by  Russell  that  followed  it,  made  brilliant 
use  of  a  series  of  problems  in  logic  and  language  that  perplexed  Russell.  Russell  was 
concerned  about  a  number  of  puzzles  that  arise  around  phrases  like,  “The  current  President 
of  the  United  States. . Such  a  phrase  seems  to  act  like  a  name,  and  yet  a  number  of  strange 
outcomes  arise  if  we  treat  it  as  a  name. 

First,  suppose  we  say 

It  is  not  the  case  that  Sam  is  bald. 

Now  suppose  that  anyone  who  is  not  bald  has  hair.  Then,  we  could  reason  that: 

Sam  has  hair. 

That  seems  correct,  if  we  grant  the  premise  that  all  those  who  are  not  bald  have  hair.  But 
now  consider  the  following  sentence. 

It  is  not  the  case  that  the  present  King  of  France  is  bald. 

By  the  same  reasoning,  this  would  seem  to  entail  that: 

The  present  King  of  France  has  hair. 

But  that’s  not  right.  There  is  no  present  King  of  France! 

It  gets  worse.  Let  us  assert  that 

The  present  King  of  France  does  not  exist. 

This  seems  to  pick  out  a  thing,  the  present  King  of  France,  and  ascribe  to  it  a  property,  not 
existing.  After  all,  in  our  logical  language,  each  name  must  refer  to  something.  But  if  we 
can  pick  out  that  thing  in  order  to  describe  it  as  not  existing,  does  it  not  exist?  That  is,  is 
there  not  a  thing  to  which  the  term  refers? 

Some  philosophers  indeed  argued  that  every  term,  even  in  a  natural  language,  must  have 
a  thing  that  it  refers  to.  The  philosopher  Alexius  Meinong  (1853-1920),  for  example, 
proposed  that  every  name  has  a  referent  that  has  being,  but  that  existence  was  reserved  for 
particular  actual  objects.  This  is  very  strange,  when  you  consider  it:  it  means  that  “the 
round  square”  refers  to  something,  a  something  that  has  being,  but  that  lacks  existence. 
Russell  thought  this  a  terrible  solution,  and  wanted  to  find  another. 

Russell  uses  a  different  example  to  illustrate  a  third  problem.  Suppose  that 

George  IV  wished  to  know  whether  Scott  was  the  author  of  Waverley. 

Here  Russell  raises  a  problem  related  to  one  that  the  mathematician  Gottlob  Frege  had 
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already  observed.  Namely,  if  Scott=the  author  of  Waverley,  then  one  might  suppose  that 
we  could  substitute  “Scott”  where  we  see  “the  author  of  Waverley”  and  get  a  sentence 
that  has  the  same  truth  value.  That  is,  we  introduced  above  a  rule — indiscernibility  of 
identicals — that,  if  applied  here,  should  allow  us  to  replace  “the  author  of  Waverley”  with 
“Scott”  if  Scott=the  author  of  Waverley.  But  that  fails:  it  is  not  the  case  that 

George  IV  wished  to  know  whether  Scott  was  Scott. 

George  IV  already  knows  that  Scott  is  Scott. 

Russell  put  forward  a  brilliant  solution  to  these  puzzles.  He  developed  an  analysis  of  some 
English  phrases  into  a  logical  form  that  is  rather  different  than  we  might  expect.  For 
example,  he  argues  the  proper  form  to  translate  “It  is  not  the  case  that  the  present  King  of 
France  is  bald”  is  something  like  this.  Let  “Gx”  mean  “x  is  the  present  king  of  France”  and 
“Hx”  mean  “x  is  bald”,  then  this  sentence  is  translated: 

3x((Gx  A  Hx)  A  Vy(Gy  -»  x=y)) 

This  says  there  is  something — call  it  x  for  now — that  is  the  present  king  of  France,  and  that 
thing  is  bald,  and  if  anything  is  the  present  king  of  France  it  is  identical  to  x.  This  second 
clause  is  a  way  of  saying  that  there  is  only  one  king  of  France,  which  is  how  Russell  captures 
the  meaning  of  “the”  in  “the  present  king  of  France”. 

This  sentence  is  false,  because  there  is  no  present  king  of  France.  But  to  deny  that  the 
present  king  of  France  is  bald  is  to  assert  rather  that 

3x((Gx  A  -iHx)  A  Vy(Gy  -»  x=y)) 

This  sentence  is  false  also.  It  cannot  be  used  to  conclude  that  there  is  a  present  king  of 
France  who  is  hirsute. 

Similar  quick  work  can  be  done  with  the  puzzle  about  existence.  “The  present  king  of 
France  does  not  exist”  is  equivalent  to  “It  is  not  the  case  that  the  present  king  of  France 
exists”  and  this  we  translate  as: 

i3x(Gx  A  Vy(Gy  -»  x=y)) 

Note  that  there  is  no  need  in  this  formula  for  a  name  that  refers  to  nothing.  There  is  no 
name  in  this  formula. 

Finally,  when  George  IV  wished  to  know  whether  Scott  was  the  author  of  Waverley,  we  can 
let  “a”  stand  for  “Scott”,  and  “Wx”  mean  “x  authored  Waverley”,  and  now  assert  that  what 
the  king  wanted  to  know  was  whether  the  following  is  true: 

3x((Wx  A  Vy(Wy  -»  x=y))  A  x=a) 

In  this,  the  part  of  the  formula  that  captures  the  meaning  of  “the  author  of 
Waverley”  requires  no  name,  and  so  there  is  no  issue  of  applying  the  indiscernibility  of 
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identicals  rule.  (Remember  that  the  indiscernibility  of  identicals  rule  allows  the  replacement 
of  a  symbolic  term  with  an  identical  symbolic  term.  In  this  formula,  there  is  no  symbolic 
term  for  “the  author  of  Waverley ”,  and  so  even  if  Scott  is  the  author  of  Waverley,  the 
indiscernibility  of  identicals  rule  cannot  be  applied  here.) 

Russell  has  done  something  very  clever.  He  found  a  way  to  interpret  a  phrase  like  “the 
present  king  of  France”  as  a  complex  logical  formula;  such  a  formula  can  be  constructed 
so  that  it  uses  no  names  to  capture  the  meaning  of  the  phrase.  It  is  an  interesting  question 
whether  Russell’s  analysis  should  be  interpreted  as  describing,  in  some  sense,  what  is  really 
inside  our  minds  when  we  use  a  phrase  like  “The  present  king  of  France.”  That’s  perhaps 
an  issue  for  cognitive  scientists  to  settle.  Our  interest  is  that  Russell  inspires  a  new  and 
flexible  way  to  use  first  order  logic  to  understand  possible  interpretations  of  these  kinds  of 
utterances. 

Russell’s  translations  also  suggest  a  surprising  possibility:  perhaps  many  names,  or  even  all 
names,  are  actually  phrases  like  these  that  are  uniquely  true  of  one  and  only  one  thing.  That 
was  of  interest  to  philosophers  who  wanted  to  explain  the  nature  of  reference;  it  suggests 
that  reference  could  be  explained  using  the  notion  of  a  complex  predicate  expression 
being  true  of  one  thing.  That  is  a  radical  suggestion,  and  one  that  Russell  developed  and 
defended.  He  proposed  that  the  only  names  were  the  very  basic  primitive  “this”  and  “that”. 
All  other  natural  language  names  could  then  be  analyzed  into  complex  phrases  like  those 
above.  This  is  an  issue  for  the  philosophy  of  language,  and  we  will  not  consider  it  further 
here. 

Another  benefit  of  Russell’s  translation  is  that  it  illustrates  how  to  count  with  the  quantifier. 
This  is  of  great  interest  to  our  logic.  Any  sentence  of  the  form  “there  is  only  one  thing 
that  is  <J>”  can  be  translated: 

3x(5>(x)AVy($(y)  -»  x=y)) 

Russell’s  insight  is  that  if  only  one  thing  is  $>,  then  anything  that  turns  out  to  be  $  must  be 
the  same  one  thing. 

A  little  ingenuity  shows  that  we  can  use  his  insight  to  say,  there  are  exactly  two  things  that 
are  It  might  be  helpful  at  first  to  separate  out  “there  are  at  least  two”  and  “there  are  at 
most  two”.  These  are: 

3x3 y  (($(x)A$(y))  A  ->x  =y) 

VxVyVz  (((ft>(x)A(t,(y))A$(z))  -»  ((x=y  v  x=z)  v  y=z)) 

The  first  sentence  says,  there  exists  a  thing  x  and  a  thing  y  such  that  x  has  property  and  y 
has  property  $  and  x  and  y  are  not  the  same  thing.  This  asserts  there  are  at  least  two  things 
that  have  property  The  second  sentence  says  for  any  x,  y,  z,  if  each  has  the  property  $>, 
that  at  least  one  of  them  is  the  same  as  the  other.  This  asserts  that  there  are  at  most  two 
things  that  have  property  $>. 

Combine  those  with  a  conjunction,  and  you  have  the  assertion  at  least  two  things  are  T . 
and  at  most  two  things  are  0.  That  is,  there  are  exactly  two  things  that  are  <t>. 
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(3x3y  ((<J>(x)A<J>(y))  A  -i. x=y)  A  VxVyVz  (((5>(x)A$(y))A$(z))  -»  ((x=y  v  x=z)  v 

Y=z))) 

That’s  awkward,  but  it  shows  that  we  can  express  any  particular  quantity  using  our  existing 
logical  language.  We  will  be  able  to  say,  for  example,  that  there  are  exactly  17  things  that 
are  $.  It  is  quite  surprising  to  think  that  we  do  not  need  numbers  to  be  able  to  express 
particular  finite  quantities,  and  that  our  logic  is  strong  enough  to  do  this. 

15.6  Problems 

1.  For  each  of  the  following,  describe  whether  the  relation  is  reflexive,  symmetric,  or 
transitive.  If  it  lacks  any  of  these  properties,  give  an  example  of  where  the  property 
would  fail  for  the  relation.  (That  is,  if  you  say  a  relation  is  not  symmetric,  for 
example,  then  give  an  example  of  an  exception.)  Assume  a  domain  of  discourse  of 
humans. 

a.  ...  is  the  sibling  of . . . 

b.  ...  is  the  mother  of . . . 

c.  ...  is  the  same  nationality  as  . . . 

d.  ...  is  older  than  . . . 

e.  ...  is  in  love  with  . . . 


2. 


a. 

b. 


c. 

d. 

e. 

f. 

g- 

h. 

i. 

j- 


3. 


a. 

b. 


c. 

d. 


e. 


Make  your  own  key  and  then  translate  the  following  into  our  logical  language.  You 
should  use  a  function  in  your  translation  of  each  sentence  that  has  an  implicit  function. 

The  mother  of  Ludwig  is  musical. 

Ludwig  is  not  musical. 

Ludwig’s  paternal  grandmother  is  musical,  but  his  maternal  grandmother  is  not. 
Ludwig’s  father  is  taller  than  his  mother. 

Ludwig’s  mother  is  not  taller  than  Ludwig,  or  his  father. 

Ludwig’s  mother  is  not  Ludwig’s  father. 

Someone  is  Ludwig’s  mother. 

Someone  is  someone’s  mother. 

Everyone  has  a  mother. 

No  one  is  the  mother  of  everyone. 


Provide  your  own  key  and  translate  the  following  into  our  logic.  This  will  require 
multiple  quantifiers. 


Everyone  is  friends  with  someone. 
Someone  is  friends  with  someone. 
Someone  is  friends  with  everyone. 
Everyone  is  friends  with  everyone. 
No  one  is  friends  with  everyone. 


4.  Use  Russell’s  interpretation  to  translate  the  following  expressions.  Make  your  own  key. 
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a.  The  Emperor  of  New  York  is  rotund. 

b.  The  Emperor  of  New  Jersey  is  not  rotund. 

c.  The  Emperor  of  New  York  is  not  the  Emperor  of  New  Jersey. 

d.  There  is  no  Emperor  of  New  York  but  there  is  an  Emperor  of  New  Jersey. 

e.  There  are  two  and  only  two  Emperors  of  New  York. 

5.  Use  our  logic  to  express  the  claim:  exactly  three  things  have  property  F. 


16.  Summary  of  first  order  logic 

16.1  Elements  of  the  language 

•  Symbolic  terms  are  either  names,  indefinite  names,  variables,  or  arbitrary  terms. 

—  Names:  a,  b,  c,  d,  e. . .. 

—  Indefinite  names:  p,  q,  r.... 

—  Variables:  x,y,z.... 

—  Arbitrary  terms:  x' ,  y',z' .... 

•  Each  predicate  has  an  arity,  which  is  the  number  of  symbolic  terms  required  by 
that  predicate  to  form  a  well-formed  formula.  The  predicates  of  our  language 
are:  F,  G,  H,  I.... 

•  Each  function  has  an  arity,  which  is  the  number  of  symbolic  terms  required  by 
the  function  in  order  for  it  to  form  a  symbolic  term.  The  functions  of  our 
language  are:  f,  g,  h,  i. . .. 

•  There  are  two  quantifiers. 

—  V,  the  universal  quantifier. 

—  3,  the  existential  quantifier. 

•  The  connectives  are  the  same  as  those  of  the  propositional  logic. 

16.2  Syntax  of  the  language 

•  An  arity  n  function  combined  with  n  symbolic  terms  is  a  symbolic  term. 

•  An  arity  n  predicate  combined  with  n  symbolic  terms  is  a  well-formed  formula. 

•  If  5>  and  W  are  well-formed  formulas,  then  the  following  are  also  well-formed 

formulas.  (And  if  $>  and  VF  are  sentences,  then  the  following  are  also  sentences.) 

- 

- 

-  ($A^) 

-  ($v^E) 

- 

•  We  write  $>(a)  to  mean  $  is  a  well-formed  formula  in  which  the  symbolic  term 
a  appears. 

•  If  there  are  no  quantifiers  in  $(x)  then  x  is  a  free  variable  in  $.  (Names  are  never 
described  as  being  free.)  If  for  that  formula  $  we  write  Vx$(x)  or  dxT(x),  we  say 
that  x  is  now  bound  in  0.  A  variable  that  is  bound  is  not  free. 

•  A  well-formed  formula  with  no  free  variables  is  a  sentence. 

16.3  Semantics  of  the  language 

•  The  semantics  of  names,  predicates,  and  the  quantifiers  will  remain  intuitive  for 
us.  Advanced  logic  (with  set  theory)  is  required  to  make  these  more  precise. 

We  say: 
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—  The  domain  of  discourse  is  the  collection  of  objects  that  our  language 
is  about. 

—  A  name  refers  to  exactly  one  object  from  our  domain  of  discourse. 

—  A  predicate  of  arity  n  describes  a  property  or  relation  of  n  objects. 

—  Vx$(x)  means  that  any  object  in  our  domain  of  discourse  has 
property  5>. 

—  3x5>(x)  means  that  at  least  one  object  in  our  domain  of  discourse  has 
property  $>. 

•  If  4>  and  VF  are  sentences,  then  the  meanings  of  the  connectives  are  fully  given 
by  their  truth  tables.  These  semantics-defining  truth  tables  are: 


<t> 

T 

F 

F 

T 

<t> 

W 

(<J>  -  W) 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

<t> 

W 

($  A  \J/) 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

F 

W 

(tvf) 

T 

T 

T 

T 

F 

T 

F 

T 

T 

F 

F 

F 
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<t> 

W 

(<J>  « 

T 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

T 

Each  sentence  must  be  true  or  false,  never  both,  never  neither. 

A  sentence  that  must  be  true  is  logically  true.  (Sentences  of  our  logic  that  have 
the  same  form  as  tautologies  of  the  propositional  logic  we  can  still  call 
“tautologies”.  However,  there  are  some  sentences  of  the  first  order  logic  that 
must  be  true  but  that  do  not  have  the  form  of  tautologies  of  the  propositional 
logic.  Examples  would  include  Vx(Fx  -»  Fx)  and  Vx(Fx  v  iFx).) 

A  sentence  that  must  be  false  is  a  contradictory  sentence. 

A  sentence  that  could  be  true  or  could  be  false  is  a  contingent  sentence. 

Two  sentences  $>  and  VF  are  “equivalent”  or  “logically  equivalent”  when 
is  a  theorem. 

1 6.4  Reasoning  with  the  Language 

An  argument  is  an  ordered  list  of  sentences,  one  sentence  of  which  we  call  the 
“conclusion”  and  the  others  of  which  we  call  the  “premises”. 

A  valid  argument  is  an  argument  in  which:  necessarily,  if  the  premises  are  true, 
then  the  conclusion  is  true. 

A  sound  argument  is  a  valid  argument  with  true  premises. 

Inference  rules  allow  us  to  write  down  a  sentence  that  must  be  true,  assuming 
that  certain  other  sentences  must  be  true.  We  say  that  the  sentence  is  derived 
from  those  other  sentences  using  the  inference  rule. 

Schematically,  we  can  write  out  the  inference  rules  in  the  following  way  (think 
of  these  as  saying,  if  you  have  written  down  the  sentence(s)  above  the  line,  then 
you  can  write  down  the  sentence  below  the  line;  also,  the  order  of  the  sentences 
above  the  line,  if  there  are  several,  does  not  matter): 
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Modus  ponens 

Modus  tollens 

Double  negation 

Double  negation 

($  -  vp) 

<t> 

($-+T) 

-AF 

<t> 

-1-1$ 

T 

-.$ 

-1-1$ 

$ 

Addition 

Addition 

Modus  tollendo 
ponens 

Modus  tollendo 
ponens 

<t> 

W 

o  o 

< 

($  v  XV) 
nW 

($vT) 

(tvW) 

T 

$ 

Adj  unction 

Simplification 

Simplification 

Bicondition 

<t> 

W 

($  A  T) 

($  A  VP) 

($  ->  T) 

(T->  $) 

(<t>  A  T) 

$ 

T 

($«q/) 

Equivalence 

Equivalence 

Equivalence 

Equivalence 

(4 »«.vp) 

<t> 

($  «  VF) 

W 

($  «  M/) 

-.$ 

($  «•  W)nW 

W 

$ 

iT 

-.$ 

Repeat 

Universal 

instantiation 

Existential 

generalization 

Existential 

instantiation 

<t> 

V«$(a) 

*(P) 

3a$(a) 

<t> 

*(P) 

where  |3  is  any 
symbolic  term 

3a$(a) 

where  (3  is  any 
symbolic  term 

*(x) 

where  y  is  an  indefinite 
name  that  does  not 
appear  above  in  any 
open  proof 

•  A  proof  (or  derivation)  is  a  syntactic  method  for  showing  an  argument  is  valid. 
Our  system  has  four  kinds  of  proof  (or  derivation):  direct,  conditional,  indirect, 
and  universal. 

•  A  direct  proof  (or  direct  derivation)  is  an  ordered  list  of  sentences  in  which 
every  sentence  is  either  a  premise  or  is  derived  from  earlier  lines  using  an 
inference  rule.  The  last  line  of  the  proof  is  the  conclusion. 

•  A  conditional  proof  (or  conditional  derivation)  is  an  ordered  list  of  sentences  in 
which  every  sentence  is  either  a  premise,  is  the  special  assumption  for 
conditional  derivation,  or  is  derived  from  earlier  lines  using  an  inference  rule.  If 
the  assumption  for  conditional  derivation  is  $,  and  we  derive  as  some  step  in  the 
proof  W,  then  we  can  write  after  this  ($^VT)  as  our  conclusion. 

•  An  indirect  proof  (or  indirect  derivation,  and  also  known  as  a  reductio  ad 
absurdum)  is:  an  ordered  list  of  sentences  in  which  every  sentence  is  either  l)  a 
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premise,  2)  the  special  assumption  for  indirect  derivation  (also  sometimes  called 
the  “assumption  for  reductio”),  or  3)  derived  from  earlier  lines  using  an 
inference  rule.  If  our  assumption  for  indirect  derivation  is  i$,  and  we  derive  as 
some  step  in  the  proof  'l7  and  also  as  some  step  of  our  proof  -iW,  then  we 
conclude  that  $>. 

•  A  universal  proof  (or  universal  derivation)  is  an  ordered  list  of  sentences  in 
which  every  sentence  is  either  a  premise  or  is  derived  from  earlier  lines  (not 
within  a  completed  subproof)  using  an  inference  rule.  If  we  are  able  to  prove 
$>(x')  where  x'  does  not  appear  free  in  any  line  above  the  universal  derivation, 
then  we  conclude  that  Vx$(x). 

•  The  schematic  form  of  the  direct,  conditional,  and  indirect  proof  methods 
remain  the  same  as  they  were  for  the  propositional  logic.  We  can  use  Fitch  bars 
to  write  out  this  fourth  proof  schema  in  the  following  way: 


I  <t>W 

\/a(f)(oi) 

•  A  sentence  that  we  can  prove  without  premises  is  a  theorem. 

1 6.5  Some  advice  on  translations  using  quantifiers 

Most  phrases  in  English  that  we  want  to  translate  into  our  first  order  logic  are  of  the 
following  forms. 


Everything  is  |t| 
Vx$(x) 

Something  is  $ 
3x$(x) 

Nothing  is  $ 
-i3x$(x) 

Something  is  not  $ 

3x-i<t>(x) 

All  <t>  are  M7 
Vx(4>(x)  ->  M7(x)) 

Some  $  are  M7 
3x(<fc(x)  A  M7(x)) 

No  $  are  M7 
-.3x(<t>(x)  A  W(x)) 

Some  $  are  not  M7 
3x($(x)  A  -ivI7(x)) 

Only  $  are  M7 
Vxfldx)  ->  •fix)) 

All  and  only  $  are  M7 
Vx(<t>(x)  M7(x)) 

Part  III:  A  Look  Forward 


17.  Some  advanced  topics  in  logic 

17.1  What  do  we  study  in  advanced  logic? 

Students  may  imagine  that  in  more  advanced  logic  we  continue  with  first  order  logic, 
translating  more  complex  sentences  and  using  them  in  proofs.  But,  in  advanced  logic,  we 
often  turn  toward  quite  different,  and  more  significant,  issues.  This  chapter  is  meant  as  a 
brief  look  forward.  It  will  give  you  a  sense  of  what  various  topics  in  advanced  logic  are 
like.  This  will  hopefully  encourage  you  to  continue  to  study  logic,  and  also,  perhaps,  this 
chapter  can  act  as  a  bridge  from  what  you  have  learned  to  what  can  come  next. 

This  chapter  will  provide  brief  illustrations  of  the  following  topics: 

17.2  Axiomatic  propositional  logic 

17.3  Mathematical  induction 

17.4  The  deduction  theorem  for  propositional  logic 

17.5  Set  theory 

17.6  Axiomatic  first  order  logic 

17.7  Modal  logic 

17.8  Peano  arithmetic 

17.2  Axiomatic  propositional  logic 

The  kinds  of  logical  systems  we  have  been  studying  up  to  now  are  called  “natural  deduction 
systems”.  In  a  natural  deduction  system,  we  add  rules  as  we  need  them,  and  we  make 
no  attempt  to  specify  the  language  in  a  compact  form.  As  logic  became  much  more 
sophisticated  in  the  twentieth  century,  logicians  and  mathematicians  began  to  study  what 
logic  could  do.  For  example,  are  all  logical  truths  provable?  To  answer  a  question  like  this, 
we  must  be  much  more  precise  about  the  nature  of  our  logic.  In  particular,  we  need  to 
boil  it  down  to  its  essential  elements.  Then  we  can  study  what  those  essential  elements  are 
capable  of  doing. 

We  call  this  “the  axiomatic  approach”.  It  is  ancient,  and  familiar  to  all  of  us  who  have 
studied  some  geometry  and  seen  Euclid’s  methods.  But,  in  the  last  century,  mathematicians 
developed  a  much  more  rigorous  understanding  of  the  axiomatic  approach  than  is  present 
in  Euclid’s  work.  In  this  section,  we  will  describe  an  axiom  system  for  the  propositional 
logic.  I  hope  that  you  are  pleasantly  shocked  to  find  how  compact  we  can  make  the 
propositional  logic. 

We  will  use  what  are  actually  called  “axiom  schemas”.  These  are  sentences  in  our 
metalanguage  that  describe  the  form  of  any  number  of  other  sentences.  Any  sentence  that 
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has  the  form  of  the  axiom  schema  is  an  instance  of  the  axiom.  Our  first  three  axiom  schemas 
are: 


(LI):  (*  -  (W  -  <D)) 

(L2):  ((«&  ->  (W-x))  -  ((d>  -  V)  -»  (<D  ->  x))) 

(L3):  ((-tf-»-.'P)-»((-.$-»'P)-*$)) 

We  suppose  that  we  have  some  number  of  atomic  sentences,  Pi,  P2,  P3  and  so  on  (it  is 
useful,  at  this  point,  to  use  a  single  letter,  with  subscripts,  so  we  do  not  find  ourselves 
running  out  of  letters).  We  have  one  rule:  modus  ponens.  We  have  one  proof  method: 
direct  derivation.  However,  we  have  now  a  new  principle  in  doing  proofs:  we  can  at  any 
time  assert  any  instance  of  an  axiom.  Thus,  each  line  of  our  direct  derivation  will  be  either 
a  premise,  an  instance  of  an  axiom,  or  derived  from  earlier  lines  using  modus  ponens. 

Later,  we  will  loosen  these  restrictions  on  direct  derivations  in  two  ways.  First,  we  will 
allow  ourselves  to  assert  theorems,  just  as  we  have  done  in  the  natural  deduction  system. 
Second,  we  will  allow  ourselves  to  apply  principles  that  we  have  proven  and  that  are 
general.  These  are  metatheorems:  theorems  proved  about  our  logic. 

The  semantics  for  this  system  are  like  those  for  the  propositional  logic:  we  assign  to 
every  atomic  formula  a  truth  value,  and  then  the  truth  value  of  the  sentences  built  using 
conditionals  and  negation  are  determined  by  the  truth  tables  for  those  connectives. 

Amazingly,  this  system  can  do  everything  that  our  propositional  logic  can  do. 
Furthermore,  because  it  is  so  small,  we  can  prove  things  about  this  logic  much  more  easily. 

What  about  conjunction,  disjunction,  and  the  biconditional?  We  introduce  these  using 
definitions.  Namely,  we  say,  whenever  we  write  “(<DAVP)”  we  mean  “n($->nW)”. 
Whenever  we  write  “(<DvvP)”  we  mean  “  Whenever  we  write  “(5*  **  VF)”  we 

mean  “(($-»vp)A (vp^$))”;  or,  in  full,  we  mean  “-.((fc-vp)— .(W-4>))”. 

We  will  introduce  a  useful  bit  of  metalanguage  at  this  point,  called  the  “turnstile”  and 
written  “I — ”.  We  will  write  “{vpi,vpj,...}|-<D”  as  a  way  of  saying  that  <D  is  provable  given 
the  assumptions  or  premises  {vFi,  Wj,  . . . }.  (Here  the  lower  case  letter  i  and  j  are  variables  or 
indefinite  names,  depending  on  context — so  “lP!”  means,  depending  on  the  context,  either 
any  sentence  or  some  specific  but  unidentified  sentence.  This  is  handy  in  our  metalanguage 
so  that  we  do  not  have  to  keep  introducing  new  Greek  letters  for  metalanguage  variables; 
it  is  useful  in  our  object  language  if  we  want  to  specify  arbitrary  or  unknown  atomic 
sentences,  P;,  Pj....)  The  brackets  and  “}”  are  used  to  indicate  a  set;  we  will  discuss 
sets  in  section  17.5,  but  for  now  just  think  of  this  is  a  collection  of  things  (in  this  case, 
sentences).  If  there  is  nothing  on  the  left  of  the  turnstile — which  we  can  write  like  this, 
“{  }l —  <D”  — then  we  know  that  $  is  a  theorem. 

Proving  theorems  using  an  axiomatic  system  is  often  more  challenging  than  proving  the 
same  theorem  in  the  natural  deduction  system.  This  is  because  you  have  fewer  resources, 
and  so  it  often  seems  to  require  some  cleverness  to  prove  a  theorem.  For  this  reason,  we 
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tend  to  continue  to  use  the  methods  and  rules  of  a  natural  deduction  system  when  we  aim 
to  apply  our  logic;  the  primary  benefit  of  an  axiom  system  is  to  allow  us  to  study  our  logic. 
This  logical  study  of  logic  is  sometimes  called  “metalogic”. 

An  example  can  be  illustrative.  It  would  be  trivial  in  our  natural  deduction  system  to  prove 
(P-P).  One  proof  would  be: 


1.  P  assumption  for  conditional  derivation 

2.  p  repeat,  1 

3.  (P  — >  P)  conditional  derivation,  1-2 

To  prove  the  equivalent  in  the  axiom  system  is  more  difficult.  We  will  prove  (P,— P,). 
That  is,  we  will  prove  {}l — (P,— P,).  What  we  must  do  in  this  system  is  find  instances  of 
our  axioms  that  we  can  use  to  show,  via  modus  ponens,  our  conclusion.  We  will  not  be 
using  the  Fitch  bars.  The  proof  will  begin  with: 

i.  ((pHfr-  Pd  -»p,))  -  ((pHp^pO)  -(p^Pi)))  instance  of  (L2) 


To  see  how  why  we  are  permitted  to  assert  this  sentence,  remember  that  we  are  allowed 
in  this  system  to  assert  at  any  time  either  a  premise  or  an  axiom.  We  are  trying  to  prove 
a  theorem,  and  so  we  have  no  premises.  Thus,  each  line  will  be  either  an  instance  of  an 
axiom,  or  will  be  derived  from  earlier  lines  using  modus  ponens.  How  then  is  line  1  an 
instance  of  an  axiom? 

Recall  that  axiom  (L2)  is:  (($>  -»  (R7— y))  (($  R7)  ->  ($  ->  y)).  We  replace  $  with  Plt 
and  we  get  ((Pi  -»  (T7— y))  -»  ((Pi  -»  R7)  ->  (Pt  -»  y)).  We  replace  R7  with  (Pi->  Pt)  and  we 
have  ((Pi  ->  ((Pi-»  Pi)  — y))  -  ((Pi  -  (Pi-  Pi))  -  (Pi  -  y)).  Finally,  we  replace  y  with  P1; 
and  we  end  up  with  the  instance  that  is  line  1. 

Continuing  the  proof,  we  have: 


1.  ((Pl-((Pl-  Pi)  -Pi))  -((Pl-(P!-P,))  -(P,-Pl))) 

instance  of  (L2) 

2.  (Pi-((Pr*  Pi)  -P,)) 

instance  of  (Ll) 

3.  ((PHP^Pi))  -(P,-Pi)) 

modus  ponens  1,  2 

4.  (Pi-lPj-Pd) 

instance  of  (Ll) 

5.  (Pi-Pi) 

modus  ponens  3,  4 

That  proof  is  a  bit  longer,  and  less  intuitive,  than  our  natural  deduction  proof.  Nonetheless, 
it  does  illustrate  that  we  can  prove  (Pi-  Pi)  in  this  system. 

We  can  recreate  this  proof  at  the  level  of  our  metalanguage,  and  show  that  ($— $>).  That 
would  simply  mean  doing  the  same  proof,  where  every  Pt  is  replaced  by  $>. 
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With  the  axiomatic  propositional  logic,  it  is  not  difficult  to  prove  two  important  results 
about  the  logic.  First,  others  have  proven  that  this  propositional  logic  is  complete. 
Intuitively,  a  logical  system  is  complete  when  all  of  the  truths  of  that  system  are  provable. 
In  practice,  we  must  carefully  define  for  each  particular  logical  system  what  completeness 
will  mean.  For  the  propositional  logic,  “completeness”  means  that  all  the  tautologies  are 
theorems.  That  is,  every  sentence  that  must  be  true  is  provable.  This  is  in  fact  the  case. 
Intuitively,  a  logical  system  is  consistent  if  it  is  not  possible  to  prove  some  sentence  and  also 
prove  its  denial.  The  propositional  logic  is  also  consistent. 

Unfortunately,  these  proofs  of  completeness  and  consistency  are  a  bit  longer  than  we  want 
to  include  in  this  chapter’s  brief  overview.1  However,  to  give  a  sense  of  how  one  goes 
about  proving  things  about  the  axiomatic  propositional  logic,  we  can  prove  another  result, 
the  deduction  theorem.  It  is  helpful  to  quickly  add  to  the  axiomatic  system  additional 
results  that  simplify  proofs.  The  most  useful  of  these  is  the  deduction  theorem;  but  before 
we  can  prove  the  deduction  theorem,  we  need  to  introduce  a  new  proof  method.  We  do 
this  in  the  next  section;  and  prove  the  deduction  theorem  in  the  section  after  that. 

17.3  Mathematical  induction 

In  mathematics,  and  in  advanced  logic,  we  often  make  use  of  a  special  form  of  reasoning 
called  “mathematical  induction”.  (This  name  is  unfortunate,  since  “induction”  is  also  the 
name  of  a  reasoning  method  that  is  not  a  deductive  method.  However,  mathematical 
induction  is  a  deductive  method.)  The  idea  of  mathematical  induction  is  very  powerful, 
and  proves  to  be  one  of  the  most  useful  methods  for  proving  things  about  our  logic.  It  can 
be  used  whenever  the  collection  of  things  that  we  aim  to  prove  something  about  is  ordered 
in  the  right  way.  An  example  of  a  set  that  is  ordered  the  right  way  is  the  natural  numbers, 
where  there  is  a  least  number  (l),  and  each  number  after  1  has  a  unique  predecessor. 

Here  is  the  method.  Suppose  we  want  to  prove  that  a  property  $  is  had  by  every  member 
of  some  collection  of  objects  that  can  be  ordered  in  the  right  way. 

First,  we  prove  the  base  case.  This  means  that  we  find  the  least  element(s)  and  prove  that  it 
has  (or  they  have)  the  property  in  question. 

Second,  we  make  the  induction  hypothesis.  We  assume  that  some  arbitrary  object  in  our 
collection — the  nth  objection  in  our  ordered  collection — has  the  property  4>. 

Third,  we  prove  the  induction  step.  We  show  that  if  the  arbitrary  nrh  object  in  our  ordering 
of  the  objects  in  our  domain  has  property  3>,  then  the  next  object  (the  n+lth  object)  in  our 
ordering  of  the  objects  in  our  domain  has  property  4>. 

We  are  then  done:  we  conclude  that  every  object  in  our  domain  has  the  property  $>. 

An  alternative  way  of  formatting  a  proof  using  mathematical  induction  uses  a  different 
formulation  of  the  induction  hypothesis.  We  pick  an  arbitrary  object  in  our  ordering,  the 

nth  item.  Then,  we  assume  that  every  object  in  our  ordering  before  the  nth  item  has  the 
property  5>.  For  the  induction  step,  we  prove  that  if  each  object  in  the  ordering  before  the 
nth  object  has  the  property  4>,  then  the  nth  item  has  the  property  4>.  (This  is  sometimes  called 
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“strong  mathematical  induction”,  and  then  the  method  above  is  called  “weak  mathematical 
induction”.  Some  logicians  insist  that  the  two  methods  are  identical  and  so  they  do  not 
appreciate  the  difference  in  terms.) 

The  reasoning  here  is  hopefully  intuitive.  We  show  the  first  item  has  the  property,  and  that 
if  any  object  has  the  property  then  the  next  object  does.  In  that  case,  every  object  must  have 
the  property,  because,  like  falling  dominoes,  we  know  the  first  element  has  the  property, 
but  then  the  second  element  does,  but  then  the  third  element  does,  and  so  on. 

It  will  be  helpful  to  give  an  example  of  mathematical  induction  that  backs  away  from 
our  logic  and  assumes  some  very  basic  mathematics,  since  many  people  are  familiar  with 
arithmetic  from  years  of  schooling.  Legend  has  it  that  as  a  young  child  the  mathematician 
Carl  Friedrich  Gauss  (1777-1855)  was  instructed  by  a  teacher — seeking  to  occupy  the  boy 
in  order  to  shut  him  up  for  a  while — to  add  up  all  the  numbers  to  100.  Gauss  very  quickly 
gave  the  teacher  the  answer.  The  legend  is  that  he  promptly  saw  a  pattern  in  adding  a 

sequence  of  numbers,  and  derived  the  following  result:  1  +  2  +  3+  ...  +  n  =  (n2+n)  12. 
Let’s  prove  this  using  mathematical  induction. 

First,  we  prove  the  base  case.  The  sum  of  1  is  1.  This  we  see  from  our  own  reasoning. 
The  equation  is  (n2+n)  /2,  and  so  we  have  (l2+l)  /2  =  (l+l)  /2,  and  (l+l)/2  =  2/2,  and  2/ 
2  =  1.  Thus,  the  equation  works  for  the  base  case:  it  matches  our  independent  reasoning 
about  the  base  case. 

Second,  we  make  the  induction  hypothesis.  We  assume  for  some  arbitrary  n  that  the  sum 
of  numbers  up  to  the  nth  number  is  (n2+n)  12. 

Third,  we  prove  the  induction  step.  The  next  step  after  n  is  n+1.  Since  we  assume  in  the 
induction  hypothesis  that  the  sum  of  numbers  up  to  nth  number  is  (n2+n)  /2,  the  sum  up  to 
the  n+lth  number  is:  ((n2+n)/2)  +  (n+l). 

So  our  task  is  to  show  that  this  is  equivalent  to  the  proposed  theorem,  for  the  n+lth  case. 
Substitute  in  (n+l)  for  n  in  the  equation  (n2+n)  12.  That  gives  us:  ((n+l)2+  n+l)  12. 

If  we  can  show  that  ((n2+n)/2)  +  (n+l)  =  ((n+l)2+  n+l)  /2,  we  will  have  proven  the 
induction  step.  Note  that  the  left  hand  side  we  got  from  our  induction  step  plus  a  simple 
observation.  The  right  hand  side  we  got  from  applying  the  equation.  Our  question  then 
is,  are  the  results  the  same?  That  is,  does  the  equation  match  what  we  showed  to  be  true? 

Consider  the  left  side.  We  see: 

((n2+n)/2)  +  (n+l)  =  ((n2+n)/2)  +  (2n+2)/2) 


and 


((n2+n)/2)  +  (2n+2)/2)  =  ((n2+n)  +  (2n+2))/2 
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and 


((n2+n)  +  (2n+2))/2  =  (n2+  3n  +  2)12 

Consider  now  the  right  side.  Here,  we  are  applying  the  equation.  We  are  hoping  it  will 
come  out  the  same  as  our  independent  reasoning  above.  We  see: 

((n+l)2+  n+l)  12  =  ((n“  +  2n  +  l)  +  n  +  l)/2 


and 


((n2  +  2n  +  l)  +  n  +  l)/2  =  (n2  +  3n  +  2)  12 

So,  since  the  two  are  identical  (that  is,  our  reasoning  based  on  induction  and  basic 
observations  matched  what  the  equation  provides),  we  have  proven  the  equation  applies  in 

the  n+lth  case,  and  so  we  have  proven  the  induction  step. 

Using  mathematical  induction  we  now  conclude  that  the  sum  of  numbers  up  to  the 
nth  number,  for  any  n,  is  always  (n2+n)/2. 

17.4  The  deduction  theorem 

How  can  mathematical  induction  be  useful  for  something  like  studying  our  logic?  Here 
is  a  practical  case.  In  chapter  6,  we  introduced  conditional  derivations.  We  reasoned  that 
the  proof  method  looked  obviously  correct,  and  then  we  continued  using  it  from  that  point 
on.  Instead,  using  mathematical  induction,  we  will  now  prove  that  conditional  derivation 
works.  We  will  do  this  by  proving  a  theorem  about  our  logic:  the  deduction  theorem. 

The  deduction  theorem  states  that,  in  our  propositional  logic,  if  with  some  premises, 
including  $>,  we  can  prove  W,  then  ($>^VI/).  This  is  precisely  what  we  want  from 
conditional  derivation.  We  are  saying  that  if  we  have  $  as  a  premise,  and  we  are  then  able 
to  prove  W,  then  we  can  assert  the  conditional  (^^T). 

To  prove  this,  we  will  use  mathematical  induction.  At  this  point,  students  will  no  doubt 
notice  a  significant  change  in  our  approach  to  logic.  As  logic  becomes  more  advanced,  we 
tend  to  more  frequently  perform  proofs  that  are  stated  partly  in  English,  and  that  are  given 
in  what  seems  a  more  informal  manner.  The  proofs  are  in  fact  not  less  formal,  but  they 
are  being  done  in  the  metalanguage,  supplemented  with  our  natural  language.  This  may  at 
first  seem  strange,  because  by  working  in  an  axiom  system  we  are  also  making  some  aspects 
of  our  system  more  formal.  But  the  axiomatic  approach  is  there  to  demonstrate  certain 
features  of  our  language.  Having  convinced  ourselves  of  the  basics,  we  will  allow  ourselves 
to  leap  ahead  often  now  in  our  explanation  of  proofs. 

Let  S  be  a  set  of  sentences  of  our  logical  language.  S  can  be  the  empty  set.  These  are 
(all  but  one  of)  our  premises.  We  want  to  single  out  a  particular  premise,  call  it  4>.  The 
deduction  theorem  says  that  if  S  U  {$}l —  W  then  S  I —  (5>^VE).  The  expression  “S  u{$}” 
just  means  the  set  that  collects  together  both  all  the  things  in  S  and  also  all  the  things  in 
{$>}.  Or,  put  informally:  it  means  take  the  set  S  and  add  to  it  also  the  sentence  4>. 
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What  will  be  our  ordered  list  of  things  upon  which  we  perform  mathematical  induction? 
A  proof,  recall,  is  an  ordered  list  of  sentences,  the  last  of  which  is  our  conclusion.  So  we 
can  do  induction  upon  a  proof.  We  let  $!,  $*3. .  .3L  be  the  list  of  sentences  in  our  proof. 

Since  this  is  a  proof  of  'l7  and  <!>„  is  our  last  step  in  the  proof,  5>n  is  our  conclusion  We 
prove  the  theorem  by  considering  an  arbitrary  proof  of  M7  that  assumes  S  U  {<!>}. 

First,  we  consider  the  first  line  of  our  proof.  5L  must  be  either  (a)  an  axiom,  (b)  one  of  the 
sentences  in  the  set  of  premises  S,  (c)  or  it  must  be  $.  Axiom  (Ll)  tells  us  that  ($>  ->  (M7  -> 
*)).  and  an  instance  of  this  is  (*i  -  (<*>  -  $1)).  Thus,  with  this  instance  of  (Ll),  and  modus 
ponens,  we  could  derive  (<I>  ->  $>,).  This  tells  us  that  if  either  (a)  $1  is  an  axiom  or  (b)  it  is  a 
premise,  we  can  prove  (<t>  -*■  $1).  In  case  (c),  where  the  first  line  of  the  proof  is  <t>,  we  can 
provide  a  proof  like  the  one  above  (in  section  17.2)  to  show  that  (<t>  -»  $>).  Thus,  in  each 
case,  if  we  can  have  $1  as  a  first  step  of  our  proof,  we  know  that  we  could  also  prove  that 
($  ->  $j).  The  deduction  theorem  is  true  of  our  base  case  (the  first  step  of  the  proof). 

Now  we  make  the  inductive  assumption.  Let  j  be  any  step  in  our  proof.  We  assume  that 
for  any  sentence  in  our  ordering  before  step  j,  the  deduction  theorem  applies.  That  is,  for 
any  i<j,  S  I — 

We  now  prove  that  for  the  jth  step  of  our  proof,  the  deduction  theorem  applies.  The  next 
step  in  the  proof  is  line  ?>j.  This  step  j  is  either  (a)  an  axiom,  (b)  one  of  the  sentences  in  the 
set  of  premises  S,  (c)  it  is  $,  or  (d)  it  follows  from  earlier  steps  in  the  proof  and  <P|,  with 
modus  ponens.  In  handling  the  base  case,  we  have  already  shown  how  cases  (a),  (b),  and  (c) 
can  be  used  to  show  that  the  deduction  theorem  is  true  of  these  cases.  The  same  arguments 
would  apply  for  step  j  of  the  proof.  We  need  only  consider  the  additional  case  of  (d). 

We  consider  then  the  case  where  follows  from  earlier  steps  in  the  proof  $g  and  <P|,.  That 
means  that  one  of  these  (either  $>g  or  <ty)  is  a  conditional,  and  the  other  is  the  antecedent  to 
the  conditional.  Our  induction  hypothesis  says  that  for  any  <J>;,  where  i<j,  S  I —  (<$^<I>i).  So, 
that  means  for  g  and  h,  because  g<j  and  h<j,  ($>^<I>S)  and  also  ($>  -»  <$!,).  We  have  noted 
that  one  of  these  formulas  must  be  a  conditional;  suppose  the  conditional  is  <$h-  Thus,  <P|, 
has  the  form  So  we  have  by  the  induction  hypothesis  that  (<$^<t>„)  and  also  ($> 

-  (*r*j)).  But  we  have  as  an  instance  of  axiom  (L2)  that  (($>  -»  -»  (($^$g)  -> 

($>^<I>j))).  Repeated  applications  of  modus  ponens  would  give  us  ($>^<I>j). 

This  applies  to  any  arbitrary  step  of  our  proof,  and  so  it  applies  when  j=n.  That  is,  it  applies 
to  the  last  step  of  the  proof.  We  have  shown  that  (<f>^<t>II).  Since  this  was  a  proof  of  ML 
<t>„  is  ML  We  have  thus  shown  that  if  S  u{*}l-  M7  then  S  I —  (tf^M7). 

This  proof  was  made  simple  by  the  fact  that  we  had  very  few  cases  to  consider:  each  line 
of  the  proof  is  either  a  premise,  an  axiom,  or  derived  from  earlier  lines  using  a  single  rule, 
modus  ponens.  By  reducing  our  system  to  its  minimal  elements,  an  axiom  system  allows  us 
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to  prove  results  like  these  much  more  briefly  than  would  be  possible  if  we  had  many  more 
cases  to  consider. 

Although  it  adds  no  power  to  the  system  (anything  that  can  be  proved  using  the  deduction 
theorem  can  be  proved  in  the  system  without  it),  the  deduction  theorem  makes  many 
proofs  smaller.  For  example,  we  can  quickly  prove  other  principles,  such  as  the  kind  of 
inference  traditionally  called  the  “chain  rule”.  We  can  call  this  the  “chain  rule  theorem”. 


1.  (P1-P2) 

assumption 

2.  (P2-P3) 

assumption 

3.  Pi 

assumption 

4-  P2 

modus  ponens  1,  3 

5.  P3 

modus  ponens  2,  4 

Which  is  sufficient  to  show  that  {(Pi-P2),  (P2-P3),  P,}  I —  P3,  and,  therefore,  by  the 
deduction  theorem  {(Pi^P2),  (P2^P3)}  I —  (Pi~*P3).  Using  the  axioms  alone,  this  proof 
would  require  many  steps. 


17.5  Set  theory 

One  of  the  most  useful  tools  of  logic  and  mathematics  is  set  theory.  Using  very  simple 
principles,  set  theory  allows  us  to  construct  some  of  the  most  complex  elements  of  logic 
and  mathematics.  This  allows  for  a  pleasing  elegance:  it  seems  we  can  understand  and 
construct  many  phenomena  from  very  intuitive  and  basic  assumptions. 

Set  theory  was  first  developed  by  the  mathematician  Georg  Cantor  (1845-1918).  The 
fundamental  idea  is  that  of  a  set.  A  set  is  a  collection  of  things,  and  the  identity  of  a  set  is 
determined  by  its  elements.  A  set  is  not  identical  to  its  elements,  however.  There  is,  for 
example,  an  empty  set:  a  set  that  contains  nothing.  It  is  however  something:  it  is  the  set 
with  no  members. 

With  the  notion  of  a  set,  some  basic  ideas  about  set  formation,  and  our  first  order  logic, 
we  have  a  powerful  theory  that  is  useful  throughout  logic,  and  allows  us  to  do  additional 
things  such  as  construct  the  numbers  and  study  infinity.  In  this  section,  we  can  review 
some  concepts  of  basic  intuitive  set  theory,  which  we  will  call  “natural  set  theory”. 

The  idea  of  a  set  is  that  it  is  a  collection  of  things.  Once  we  pick  our  domain  of  discourse, 
we  can  describe  collections  of  things  from  that  domain  of  discourse  as  sets.  We  write  these 
sets  using  braces.  So,  if  our  domain  of  discourse  were  the  natural  numbers,  the  following 
would  be  a  set: 

{1,2,3} 

A  set  is  determined  by  its  members  (also  sometimes  called  “elements”),  but  it  is  not  the  same 
thing  as  its  members:  1  is  not  the  same  thing  as  {l}.  And,  we  assume  there  is  an  empty 
set,  the  set  of  nothing.  We  can  write  this  as  {}  or  as  0.  Sets  can  contain  other  sets:  the 
following  is  a  set  containing  three  different  members. 
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{{},{{}},«{}}}} 

This  is  also  an  example  of  a  pure  set:  it  contains  nothing  but  sets.  If  we  develop  our  set 
theory  with  sets  alone  (if  our  domain  of  discourse  is  only  sets)  and  without  any  other  kinds 
of  elements,  we  call  it  “pure  set  theory”. 

The  members  of  sets  are  not  ordered.  Thus 

{1,2,3}  =  {3,  2,1} 

But  an  ordered  set  is  a  set  in  which  the  order  does  matter.  We  can  indicate  an  ordered  set 
using  angle  brackets,  instead  of  curly  brackets.  Thus: 

<1,  2,  3>  =  <1,  2,  3> 


but 


<1,  2,  3>  *  <3,  2,  1> 

We  will  write  { . . . }  for  a  set  when  we  want  to  show  its  contents,  and  A,  B,  ...  for  sets  when 
we  are  dealing  with  them  more  generally.  We  write  xGA  to  mean  that  x  is  a  member  of 
the  set  A.  As  noted,  sets  can  be  members  of  sets. 

Sets  are  defined  by  their  contents,  so  two  sets  are  the  same  set  if  they  have  the  same  contents. 
A  =  B  if  and  only  if  Vx(xGA  xGB) 

This  is  interesting  because  it  can  be  a  definition  of  identity  in  set  theory.  In  the  natural 
deduction  system  first  order  logic,  we  needed  to  take  identity  as  a  primitive.  Here,  we  have 
instead  defined  it  using  membership  and  our  first  order  logic. 

If  all  the  contents  of  a  set  A  are  in  another  set  B,  we  say  A  is  a  subset  of  B. 

A  c  B  if  and  only  if  Vx(xGA  -»  xGB) 

A  proper  subset  is  a  subset  that  is  not  identical  (that  means  B  has  something  not  in  A,  in  the 
following  case): 

A  c  B  if  and  only  if  (A  C  B  A  A*B) 

The  empty  set  is  a  subset  of  every  set. 

The  power  set  operation  gives  us  the  set  of  all  subsets  of  a  set. 

$3  (A)  =  B  if  and  only  if  Vx(x  C  A  -»  xGB) 

There  are  always  2n  members  in  the  power  set  of  a  set  with  n  members.  That  is,  if  A  has  n 
members,  then  g>(A)  has  2n  members. 
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The  cardinal  size  of  a  set  is  determined  by  finding  a  one-to-one  correspondence  with  the 
members  of  the  set.  Two  sets  have  the  same  cardinal  size  (we  say,  they  have  the  same 
cardinality)  if  there  is  some  way  to  show  there  exists  a  one-to-one  correspondence  between 
all  the  members  of  one  set  and  all  the  members  of  the  other.  For  the  cardinality  of  some  set 
A,  we  can  write 

lAl 

There  is  a  one-to-one  correspondence  to  be  found  between  all  the  members  of  A  and  all 
the  members  of  B,  if  and  only  if 

lAl  =  iBl 

If  A  c  B  then  lAl  <  iBl. 

The  union  of  two  sets  is  a  set  that  contains  every  member  of  either  set. 

AUB  is  defined  as  satisfying  Vx((xGA  v  x  GB)  -»  x  G  AUB) 

The  intersection  of  two  sets  is  a  set  that  contains  every  member  that  is  in  both  the  sets. 

AnB  is  defined  as  satisfying  Vx((xGA  A  x  GB)  -»  x  G  AnB) 

A  shorthand  way  to  describe  a  set  is  to  write  the  following: 

{  x  I  $(x)} 

This  is  the  set  of  all  those  things  x  such  that  x  has  property  $>.  So  for  example  if  our  domain 
of  discourse  were  natural  numbers,  then  the  set  of  all  numbers  greater  than  100  could  be 
written:  {  x  I  x  >  100  }. 

A  relation  is  a  set  of  ordered  sets  of  more  than  one  element.  For  example,  a  binary  relation 
meant  to  represent  squaring  might  include  {...  <9,  81>,  <10,  100>  ...};  a  trinary  relation 
meant  to  represent  factors  might  include  { . . .  <9,  3,  3>,  <10,  2,  5>  . . . };  and  so  on. 

One  useful  kind  of  relation  is  a  product.  The  product  of  two  sets  is  a  set  of  all  the  ordered 
pairs  taking  a  member  from  the  first  set  and  a  member  from  the  second. 

A  x  B  is  defined  as  satisfying  VxVy((xGA  A  yGB)  <-»  <x,  y>  G  A  x  B) 

Many  of  us  are  familiar  with  the  either  of  the  Cartesian  product,  which  forms  the  Cartesian 
plane.  The  x  axis  is  the  set  of  real  numbers  R,  and  the  y  axis  is  the  set  of  real  numbers  R. 
The  Cartesian  product  is  the  set  of  ordered  pairs  RxR.  Each  such  pair  we  write  in  the 
form  <x,  y>.  These  form  a  plane,  and  we  can  identify  any  point  in  this  plane  using  these 
“Cartesian  coordinates”. 

Another  useful  kind  of  relation  is  a  function.  A  function  f  is  a  set  that  is  a  relation  between 
the  members  of  two  sets.  One  set  is  called  the  “domain”,  and  the  other  is  called  the  “range”. 
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Suppose  A  is  the  domain  and  B  is  the  range  of  a  function,  then  (if  we  let  a  be  a  member  of 
A  and  b  and  c  be  members  of  B,  so  by  writing  fab  I  mean  that  function  f  relates  a  from  its 
domain  to  b  in  its  range): 

Iff  is  a  function  from  A  into  B,  then  if  fab  and  fac  then  b=c 

This  captures  the  idea  that  for  each  “input”  (item  in  its  domain)  the  function  has  one 
“output”  (a  single  corresponding  item  in  its  range).  We  also  say  a  function  f  is 

•  from  a  set  A  if  its  domain  is  a  subset  of  A 

•  on  a  set  A  if  its  domain  is  A 

•  into  a  set  B  if  its  range  is  a  subset  of  B 

•  onto  a  set  B  if  its  range  is  B 

If  a  function  f  is  from  A  and  into  B,  and  also  f  is  such  that 
If  fab  and  fcb  then  a=c 

then  fis  a  1-to-l  function  from  A  and  into  B.  The  idea  of  being  1-to-l  is  that  fis  a 
function  that  if  reversed  would  be  a  function  also.  As  we  noted  above,  if  there  is  a  1-to-l 
function  that  is  on  A  and  onto  B,  then  lAl  =  iBl. 

If  a  function  f  is  1-to-l  on  A  and  is  into  B,  then  we  know  that  IBl  >  lAl.  (Such  a  function 
has  every  member  of  A  in  its  domain,  and  for  each  such  member  picks  out  exactly  one 
member  of  B;  but  because  we  only  know  that  the  function  is  into  B,  we  do  not  know 

whether  there  are  members  of  B  that  are  not  in  the  range  of  the  function,  and  we  cannot 

be  sure  that  there  is  some  other  1-to-l  function  on  A  and  onto  B.) 

A  common  notation  also  is  to  write  f(a)  for  the  function  fwith  a  from  its  domain.  So, 
when  we  write  this,  we  identify  the  expression  with  the  element  from  the  range  that  the 
function  relates  to  a.  That  is,  f(a)  =  b. 

We  can  prove  many  interesting  things  using  this  natural  set  theory.  For  example,  Cantor 
was  able  to  offer  us  the  following  proof  that  for  any  set  S,  l^(S)l  >  ISl.  That  is,  the 
cardinality  of  the  power  set  of  a  set  is  always  greater  than  the  cardinality  of  that  set. 

We  prove  the  claim  by  indirect  proof.  We  aim  to  show  that  lp(S)l  >  ISl,  so  we  assume  for 
reductio  that  l^>(S)l  <  ISl.  We  note  that  there  is  a  function  on  S  and  into  g>(S);  this  is  the 
function  that  takes  each  member  of  S  as  its  domain,  and  assigns  to  that  member  the  set  of 
just  that  element  in  p(S).  So,  for  example,  if  a€S  then  {a}Gp(S);  and  there  is  function  on 
S  and  into  p(S)  that  assigns  a  to  {a},bto  {b},  and  so  on. 

Since  there  is  such  a  function,  lp(S)l  >  ISl.  But  if  lp(S)l  <  ISl  and  lp(S)l  >  ISl,  then  l^(S)l  =  ISl. 
Therefore,  there  is  a  one-to-one  function  on  S  and  onto  g>(S).  Let  f  be  one  such  function. 

Consider  that  each  object  in  S  will  be  in  the  domain  of  f,  and  be  related  by  f  to  a  set  in 
p(S).  It  follows  that  each  element  of  S  must  be  related  to  a  set  that  either  does,  or  does 
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not,  contain  that  element.  In  other  words,  for  each  sGS,  f(s)  is  some  set  AG§>(S),  and  either 
sGA  or  -isGA.  Consider  now  the  set  of  all  the  objects  in  the  domain  of  f  (that  is,  all  those 
objects  in  S)  that  f  relates  to  a  set  in  p(S)  that  does  not  contain  that  element.  More  formally, 
this  means:  consider  the  set  C  (for  crazy)  where  C={  s  I  sGS  A  -is  G  f(s)}.  This  set  must  be 
in  p(S),  because  every  possible  combination  of  the  elements  of  S  is  in  ^(S) — including  the 
empty  set  and  including  S.  But  now,  what  object  in  the  domain  of  f  is  related  to  this  set? 
We  suppose  that  f  is  a  1-to-l  function  on  S  and  onto  p(S),  so  some  element  of  S  must  be 
related  by  f  to  C,  if  f  exists.  Call  this  element  c;  that  is,  suppose  f(c)=C. 

Is  cGC?  If  the  answer  is  yes,  then  it  cannot  be  that  f(c)=C,  since  by  definition  C  is  all  those 
elements  of  S  that  f  relates  to  sets  not  containing  those  elements.  So  -ic  GC.  But  then,  C 
should  contain  c,  because  C  contains  all  those  elements  of  S  that  are  related  by  f  to  sets  that 
do  not  contain  that  element.  So  cGC  and  -ic  GC.  We  have  a  contradiction.  We  conclude 
that  the  source  of  the  contradiction  was  the  assumption  for  reductio,  l^(S)l  <  ISl.  Thus,  for 
any  set  S,  l^(S)l  >  ISl. 

This  result  is  sometimes  called  “Cantor’s  theorem”.  It  has  interesting  consequences, 
including  that  there  cannot  be  a  largest  set:  every  set  has  a  powerset  that  is  larger  than  it. 
This  includes  even  infinite  sets;  Cantor’s  theorem  shows  us  that  there  are  different  sized 
infinities,  some  larger  than  others;  and,  there  is  no  largest  infinity. 

1 7.6  Axiomatic  first  order  logic 

In  this  section  we  briefly  describe  how  the  axiomatic  approach  can  be  taken  toward  the  first 
order  logic,  and  how  a  semantics  can  be  constructed.  For  simplicity,  we  shall  assume  our 
example  language  does  not  include  functions. 

The  basic  elements  of  our  language  are  connectives  “i”;  predicates  F;(...),  each  with 
a  specific  arity;  the  quantifier  V;  and  two  kinds  of  symbolic  terms:  variables  xi...Xi...  and 
names 

Our  syntax  is  similar  to  the  syntax  described  in  13.2.  Each  predicate  of  arity  n  followed  by 
n  symbolic  terms  is  well  formed.  If  5>  and  VF  are  well  formed,  then  -T  and  (<f>^'T)  are  well 
formed.  Every  well-formed  formula  preceded  by  Vx;  is  well  formed. 

A  typical  axiom  system  for  first  order  logic  takes  the  axioms  of  propositional  logic  and 
strengthens  them  with  two  additional  axioms  and  an  additional  rule. 

(L4)  (Vx,$(x,)  -  $(a)) 

(L5)  (Vx,($  -  W)  -  (4>  -Vx,T))  if  x;  is  not  free  in  $>. 

Where  a  is  any  symbolic  term. 

The  additional  rule  is  generalization.  From  I —  $>  we  can  conclude  I —  Vx,4>(x;).  This  plays 
the  role  of  universal  derivation  in  our  natural  deduction  system  for  the  first  order  logic. 
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This  compact  system,  only  slightly  larger  than  the  axiomatic  propositional  logic,  is  as 
powerful  as  our  natural  deduction  system  first  order  logic. 

This  is  a  convenient  place  to  describe,  in  a  preliminary  and  general  way,  how  we  can 
conceive  of  a  formal  semantics  for  the  first  order  logic.  When  we  introduced  the  first 
order  logic,  we  kept  the  semantics  intuitive.  We  can  now  describe  how  we  could  start  to 
develop  a  formal  semantics  for  the  language.  The  approach  here  is  one  first  developed  by 
Alfred  Tarski  (1901-1981). [14]  Tarski  introduces  a  separate  concept  of  satisfaction,  which  he 
then  uses  to  define  truth,  but  we  will  cover  over  those  details  just  to  illustrate  the  concept 
underlying  a  model. 

The  approach  is  to  assign  elements  of  our  language  to  particular  kinds  of  formal  objects. 
We  group  these  all  together  into  a  model.  Thus,  a  model  M  is  an  ordered  set  that  contains 
a  number  of  things.  First,  it  contains  a  set  D  of  the  things  that  our  language  is  about — our 
domain  of  discourse.  Second,  the  model  includes  our  interpretation,  I,  which  contains 
functions  for  the  elements  of  our  language.  For  each  name,  there  is  a  function  that  relates 
the  name  to  one  object  in  our  domain  of  discourse.  For  example,  suppose  our  domain  of 
discourse  is  natural  numbers.  Then  a  name  in  our  language  might  refer  to  1.  We  say 
then  that  the  “interpretation”  of  ^  is  1.  The  object  in  our  interpretation  that  captures  this 
idea  is  a  function  that  includes  <  au  1>. 

The  interpretation  of  each  predicate  relates  each  predicate  to  a  set  of  ordered  sets  of  objects 
from  our  domain.  Each  predicate  of  arity  n  is  related  to  a  set  of  ordered  sets  of  n  things.  We 

can  write  Dn  for  all  the  ordered  sets  of  n  elements  from  our  domain.  Then,  each  predicate 
of  arity  n  has  as  an  interpretation  some  (not  necessarily  all)  of  the  relations  in  Dn.  For 
example,  an  arity  two  predicate  F,  might  be  meant  to  capture  the  sense  of  “...  is  less  than 
or  equal  to. . Then  the  interpretation  of  Fi  is  a  function  from  Fj  to  a  set  of  ordered  pairs 
from  our  domain  of  discourse,  including  such  examples  as  {...  <1,  2>,  <1,  1000>,  <8,  10>, 

. . . }  and  so  on. 

We  then  say  that  a  sentence  like  F ,  a2a3  is  true  if  the  interpretation  for  a2  and  the 
interpretation  for  a3  are  in  the  interpretation  for  F,.  So,  if  a2  is  2,  and  a3  is  3,  then 
F i a2a3  would  be  true  because  <2,  3>  will  be  in  the  interpretation  for  F,. 

We  will  need  an  interpretation  for  the  quantifiers.  Without  going  into  the  details  of  how 
we  can  describe  this  rigorously  (Tarski  uses  a  set  of  objects  called  “sequences”),  the  idea  will 
be  that  if  we  could  put  any  name  into  the  bound  variable’s  position,  and  get  a  sentence  that 
is  true,  the  universally  quantified  statement  is  true.  To  return  to  our  example:  assuming 
again  that  our  domain  of  discourse  is  natural  numbers,  so  there  is  a  least  number  1,  then 
the  sentence  VxjFfliXj  would  be  true  if  F^xj  would  be  true  for  any  number  we  put  in  for 
Xj.  Suppose  that  the  interpretation  of  a2  is  1.  Then  F^xj  would  be  true  no  matter  what 
element  from  our  domain  of  discourse  we  took  xj  to  be  referring  to,  because  1  is  less  than 
or  equal  to  every  number  is  our  domain  of  discourse.  Therefore,  VxjFfljXj  would  be  true. 
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We  need  only  add  the  usual  interpretation  for  negation  and  the  conditional  and  we  have  a 
semantics  for  the  first  order  logic. 

This  is  all  very  brief,  but  it  explains  the  spirit  of  the  semantics  that  is  standard  for 
contemporary  logic.  Much  more  can  be  said  about  formal  semantics,  and  hopefully  you 
will  feel  encouraged  to  study  further. 

17.7  Modal  logic 

Philosophers  have  always  been  interested  in  questions  of  time  and  possibility,  and  have 
often  thought  that  these  were  intimately  related.  In  his  book,  De  Interpretatione  (titles  were 
given  to  Aristotle’s  books  by  later  editors — he  wrote  in  Greek,  and  did  not  name  his  book 
with  Latin  titles),  Aristotle  wonders  about  the  following.  Consider  a  sentence  like  “There 
will  be  a  sea  battle  tomorrow”.  Suppose  our  prediction  turns  out  true;  is  it  necessarily  true 
when  it  is  made  as  a  prediction?  This  seems  absurd.  That  is,  there  might  be  a  sea  battle 
tomorrow,  but  it  does  not  seem  that  there  must  be  a  sea  battle  tomorrow. 

For  there  is  nothing  to  prevent  someone’s  having  said  ten  thousand  years  beforehand 
that  this  would  be  the  case,  and  another’s  having  denied  it;  so  that  whichever  the  two 
was  true  to  say  then,  will  be  the  case  of  necessity.  (I8b35)[15] 

Here  is  Aristotle’s  worry.  Suppose  tomorrow  is  June  16,  2014.  Necessarily  there  either 
will  or  will  not  be  a  sea  battle  tomorrow  (let  us  assume  that  we  can  define  “sea  battle”  and 
“tomorrow”  well  enough  that  they  are  without  vagueness  or  ambiguity).  If  there  is  a  sea 
battle  tomorrow,  is  the  person  who  said  today,  or  even  ten  thousand  years  ago,  “There  will 
be  a  sea  battle  on  June  16,  2014”  necessarily  right?  That  is,  if  there  is  a  sea  battle  tomorrow, 
is  it  necessary  that  there  is  a  sea  battle  tomorrow? 

Aristotle  concludes  that,  if  there  is  a  sea  battle  tomorrow,  it  is  not  necessary  now. 

What  is,  necessarily  is,  when  it  is;  and  what  is  not,  necessarily  is  not,  when  it  is  not. 
But  not  everything  that  is,  necessarily  is;  and  not  everything  that  is  not,  necessarily 
is  not....  It  is  necessary  for  there  to  be  or  not  to  be  a  sea-battle  tomorrow;  but  it 
is  not  necessary  for  a  sea-battle  to  take  place  tomorrow,  nor  for  one  not  to  take 
place — though  it  is  necessary  for  one  to  take  place  or  not  to  take  place.  (I9b30) 

Aristotle’s  reasoning  seems  to  be,  from  “necessarily  (P  v  -iP)”,  we  should  not  conclude: 
“necessarily  P  or  necessarily  ->P”. 

Philosophers  would  like  to  be  able  to  get  clear  about  these  matters,  so  that  we  can  study 
and  ultimately  understand  necessity,  possibility,  and  time.  For  this  purpose,  philosophers 
and  logicians  have  developed  modal  logic.  In  this  logic,  we  have  most  often  distinguished 
possibility  from  time,  and  treated  them  as — at  least  potentially — independent.  But  that  does 
not  mean  that  we  might  not  ultimately  discover  that  they  are  essentially  related. 

In  this  section,  we  will  describe  propositional  modal  logic;  it  is  also  possible  to  combine 
modal  logic  with  first  order  logic,  to  create  what  is  sometimes  called  “quantified  modal 
logic”.  Our  goal  however  is  to  reveal  the  highlights  and  basic  ideas  of  modal  logic,  and  that 
is  easiest  with  the  propositional  logic  as  our  starting  point. 
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Thus,  we  assume  our  axiomatic  propositional  logic — so  we  have  axioms  (Ll),  (L2),  (L3); 
modus  ponens;  and  direct  derivation.  We  also  introduce  a  new  element  to  the  language, 
alone  with  atomic  sentences,  the  conditional,  and  negation:  “necessary”,  which  we  write 
as  The  syntax  of  this  operator  is  much  like  the  syntax  of  negation:  if  $  is  a  sentence, 
then 


□$> 

is  a  sentence.  We  read  this  as  saying,  “necessarily  $>”  or  “it  is  necessary  that  3>”.  The 
semantics  for  necessity  are  a  bit  too  complex  to  be  treated  fairly  in  this  overview.  However, 
one  intuitive  way  to  read  “necessarily  $>”  is  to  understand  it  as  saying,  in  every  possible  world 
0  is  true.  Or:  in  every  possible  way  the  world  could  be,  0  is  true. 

It  is  useful  to  introduce  the  concept  of  possibility.  Fortunately,  it  appears  reasonable  to 
define  possibility  using  necessity.  We  define  “possible”  to  mean,  not  necessarily  not.  We  use 
the  symbol  “0”  for  possible. 

Thus,  if  $  is  a  sentence,  we  understand 

00 

to  be  a  sentence.  Namely,  this  is  the  sentence, 


We  read  “0$”  as  saying,  it  is  possible  that  0.  One  intuitive  semantics  is  that  it  means  in  at 
least  one  possible  world,  0  is  true',  or:  in  at  least  one  possible  way  the  world  could  be,  0  is  true. 

The  difficult  and  interesting  task  before  us  is  to  ask,  what  axioms  best  capture  the  nature 
of  necessity  and  possibility?  This  may  seem  an  odd  way  to  begin,  but  in  fact  the  benefit 
to  us  will  come  from  seeing  the  consequence  of  various  assumptions  that  we  can  embody 
in  different  axioms.  That  is,  our  choice  of  axioms  results  in  a  commitment  to  understand 
necessity  in  a  particular  way,  and  we  can  discover  then  the  consequences  of  those 
commitments.  Hopefully,  as  we  learn  more  about  the  nature  of  necessity  and  possibility, 
we  will  be  able  to  commit  to  one  of  these  axiomatizations;  or  we  will  be  able  to  improve 
upon  them. 

All  the  standard  axiomatizations  of  modal  logic  include  the  following  additional  rule,  which 
we  will  call  “necessitation”. 

0  I—  $ 


0  1—  D0 

This  adds  an  element  of  our  metalanguage  to  our  rule,  so  let’s  be  clear  about  what  it  is 
saying.  Remember  that  “0  I —  3>”  says  that  one  can  prove  $  in  this  system,  without  premises 
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(there  is  only  an  empty  set  of  premises  listed  to  the  left  of  the  turnstile).  In  other  words,  “0 
I —  asserts  that  <t>  is  a  theorem.  The  necessitation  rule  thus  says,  if  $  is  a  theorem,  then 
□$>.  The  motivation  for  the  rule  is  hopefully  obvious:  the  theorems  of  our  propositional 
logic  are  tautologies.  Tautologies  are  sentences  that  must  be  true.  And  “must”  in  this 
description  hopefully  means  at  least  as  much  as  does  “necessarily”.  So,  for  our  propositional 
logic,  the  theorems  are  all  necessarily  true. 

Different  axiomatizations  of  modal  propositional  logic  have  been  proposed.  We  will 
review  four  here,  and  discuss  the  ideas  that  underlie  them. 

The  most  basic  is  known  variously  as  “M”  or  “T”.  It  includes  the  following  additional 
axioms: 

(Ml)  (□$>  -  $) 

(M2)  (□($  -»  W)  -»  (□$>  ->  tAF)) 

Both  are  intuitive.  Axiom  (Ml)  says  that  if  $  is  necessary,  then  5>.  From  a  necessary  claim 
we  can  derive  that  the  claim  is  true.  Consider  an  example:  if  necessarily  5  >  2,  then  5  >  2. 

Axiom  (M2)  says  that  if  it  is  necessary  that  $  implies  'T,  then  if  $>  is  necessary,  lF  is 
necessary. 

An  extension  of  this  system  is  to  retain  its  two  axioms,  and  add  the  following  axiom: 

(M3)  ($  -»  □<>$>) 

The  resulting  system  is  often  called  “Brouwer,”  after  the  mathematician  Luitzen  Brouwer 
(1881-1966). 

Axiom  (M3)  is  more  interesting,  and  perhaps  you  will  find  it  controversial.  It  says  that 
if  $  is  true,  then  it  is  necessary  that  $  is  possible.  What  people  often  find  peculiar  is  the 
idea  that  a  possibility  could  be  necessary.  On  the  other  hand,  consider  Aristotle’s  example. 
Suppose  that  there  is  a  sea  battle  today.  Given  that  it  actually  is  happening,  it  is  possible 
that  it  is  happening.  And,  furthermore,  given  that  it  is  happening,  is  it  not  the  case  that  it 
must  be  possible  that  it  is  happening?  Such,  at  least,  is  one  possible  motive  for  axiom  (M3). 

More  commonly  adopted  by  philosophers  are  modal  systems  S4  or  S5.  These  systems 
assume  (Ml)  and  (M2)  (but  not  (M3)),  and  add  one  additional  axiom.  S4  adds  the  following 
axiom: 


(M4)  (□$>  ->  □□$) 

This  tells  us  that  if  $  is  necessary,  then  it  is  necessary  that  <t>  is  necessary. 

In  S4,  it  is  possible  to  prove  the  following  theorems  (that  is,  these  are  consequences  of  (Ml), 
(M2),  and  (M4)): 


(□$>  <->  □□$>) 
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(0$  «  00$) 

The  modal  system  S5  instead  adds  to  M  the  following  axiom: 

(M5)  (0$  -»  D0$) 

This  axiom  states  that  if  something  is  possible,  then  it  is  necessary  that  it  is  possible.  This 
is  often  referred  to  as  the  “S5  axiom”.  It  is  perhaps  the  most  controversial  axiom  of  the 
standard  modal  logic  systems.  Note  these  interesting  corollary  theorems  of  S5: 

(□$>  0n$) 

(0$  n0$) 

These  modal  logics  are  helpful  in  clarifying  a  number  of  matters.  Let’s  consider  several 
examples,  starting  with  questions  about  meaning  in  a  natural  language. 

Many  philosophers  pursued  modal  logic  in  the  hopes  that  it  would  help  us  understand  and 
represent  some  features  of  meaning  in  a  natural  language.  These  include  attempting  to 
better  capture  the  meaning  of  some  forms  of  “if. .  .then. . .  ”  expressions  in  a  natural  language, 
and  also  the  meaning  of  predicates  and  other  elements  of  a  natural  language. 

A  problem  with  some  “if. .  .then. . .”  expressions  concerns  that  sometimes  what  appears  to  be 
a  conditional  is  not  well  captured  by  the  conditional  in  our  propositional  logic.  In  chapter 
3,  we  had  an  example  that  included  the  sentence  “If  Miami  is  the  capital  of  Kansas,  then 
Miami  is  in  Canada”.  This  sentence  is  troubling.  Interpreting  the  conditional  as  we  have 
done,  this  sentence  is  true.  It  is  false  that  Miami  is  the  capital  of  Kansas,  and  it  is  false  that 
Miami  is  in  Canada.  A  conditional  with  a  false  antecedent  and  a  false  consequent  is  true. 

However,  some  of  us  find  that  unsatisfactory.  We  have  recognized  from  the  beginning 
of  our  study  of  logic  that  we  were  losing  some,  if  not  much,  of  the  meaning  of  a  natural 
language  sentence  in  our  translations.  But  we  might  want  to  capture  the  meaning  that  is 
being  lost  here,  especially  if  it  affects  the  truth  value  of  the  sentence.  Some  people  want  to 
say  something  like  this:  Kansas  is  in  the  United  States,  so  if  it  were  true  that  Miami  were 
the  capital  of  Kansas,  then  Miami  would  be  in  the  United  States,  not  in  Canada.  By  this 
reasoning,  this  sentence  should  be  false. 

It  seems  that  what  we  need  is  a  modal  notion  to  capture  what  is  missing  here.  We  want 
to  say,  if  Miami  were  in  Kansas,  then  it  would  be  in  the  United  States.  Some  philosophers 
thought  we  could  do  this  by  reading  such  sentences  as  implicitly  including  a  claim  about 
necessity.  Let  us  fix  the  claim  that  Kansas  is  in  the  United  States,  and  fix  the  claim  that 
anything  in  the  United  States  is  not  in  Canada,  but  allow  that  Miami  could  be  the  capital  of 
other  states.  In  other  words,  let  us  suppose  that  it  is  necessary  that  Kansas  is  in  the  United 
States,  and  it  is  necessary  that  anything  in  the  United  States  is  not  in  Canada,  but  possible 
that  Miami  is  the  capital  of  Kansas.  Then  it  appears  we  could  understand  the  troubling 
sentence  as  saying, 

Necessarily,  if  Miami  is  the  capital  of  Kansas,  then  Miami  is  in  Canada. 
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Assuming  an  implicit  key,  we  will  say, 

□(P1-P2) 

This  seems  to  make  some  progress  toward  what  we  were  after.  We  take  the  sentence  “if 
Miami  is  the  capital  of  Kansas,  then  Miami  is  in  Canada"  to  mean:  in  any  world  where  Miami 
is  in  Kansas,  then  in  that  world  Miami  is  in  Canada.  But,  given  the  assumptions  we  have  made 
above,  this  sentence  would  be  false.  There  would  be  worlds  where  Miami  is  the  capital 
of  Kansas,  and  Miami  is  not  in  Canada.  This  at  least  seems  to  capture  our  intuition  that 
the  sentence  is  (on  one  reading)  false.  There  are  further  subtleties  concerning  some  uses 
of  “if. ..  then....”  in  English,  and  it  is  not  clear  that  the  analysis  we  just  gave  is  sufficient, 
but  one  can  see  how  we  appear  to  need  modal  operators  to  better  capture  the  meaning  of 
some  utterances  of  natural  languages,  even  for  some  utterances  that  do  not  appear  (at  first 
reading)  to  include  modal  notions. 

Another  problem  concerning  meaning  has  to  do  with  attempts  to  better  capture  the 
meaning  of  more  fundamental  elements  of  our  language,  such  as  our  predicates.  Our  first 
order  logics  have  what  philosophers  call  an  “extensional  semantics”.  This  means  that  the 
meaning  of  terms  is  merely  their  referent,  and  the  meaning  of  predicates  is  the  sum  of  things 
that  they  are  true  of  (see  section  17.6).  However,  upon  reflection,  this  seems  inadequate  to 
describe  the  meaning  of  terms  and  predicates  in  a  natural  language. 

Consider  predicates.  The  philosopher  Rudolf  Carnap  (1891-1970)  used  the  following 
example;  suppose  that  human  beings  are  the  only  rational  animals  (“rational”  is  rather  hard 
to  define,  but  if  we  have  a  strong  enough  definition — language  using,  capable  of  some 
mathematics,  can  reason  about  the  future — this  seems  that  it  could  be  true,  assuming  by 
“animal”  we  mean  Terrestrial  metazoans).1 [16]  Then  the  following  predicates  would  be  true 
of  all  and  only  the  same  objects: 

...  is  a  rational  animal. 

...  is  a  human. 

That  means  the  extension  of  these  predicates  would  be  the  same.  That  is,  if  we  corralled  all 
and  only  the  rational  animals,  we  would  find  that  we  had  corralled  all  and  only  the  humans. 
If  the  meaning  of  these  predicates  were  determined  by  their  extensions,  then  they  would 
have  the  same  meaning.  So  the  sentence: 

All  and  only  humans  are  rational  animals. 

would  be  sufficient,  in  an  extensional  semantics,  to  show  that  these  predicates  mean  the 
same  thing. 

But  obviously  these  predicates  do  not  mean  the  same  thing.  How  can  we  improve  our  logic 
to  better  capture  the  meaning  of  such  natural  language  phrases?  The  philosopher  Rudolf 
Carnap  proposed  that  we  use  modal  logic  for  this  purpose.  The  idea  is  that  we  capture 
the  difference  using  the  necessity  operator.  Two  sentences  have  the  same  meaning,  for 
example,  if  necessarily  they  have  the  same  truth  value.  That  is,  “. . .  is  a  human”  and  “. . .  is  a 
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rational  animal”  would  have  the  same  meaning  if  and  only  if  they  necessarily  were  true  of 
all  and  only  the  same  things. 

We  have  not  introduced  a  semantics  for  modal  logic  combined  with  our  first  order  logic. 
However,  the  semantics  we  have  discussed  are  sufficient  to  make  sense  of  these  ideas.  Let 
us  use  the  following  key: 

Ftx:  x  is  a  rational  animal. 

F2x:  x  is  a  human. 

Then  we  can  translate  “All  and  only  rational  animals  are  human”  as: 

Vx(FiX  F2x) 

And  if  the  meaning  of  a  predicate  were  merely  its  extension  (or  were  fully  determined  by  its 
extension)  then  these  two  predicates  F,  and  F2  would  mean  the  same  thing.  The  proposal 
is  that  in  order  to  describe  the  meanings  of  predicates  in  a  natural  language,  we  must  look 
at  possible  extensions.  We  could  then  say  that  F,  and  F2have  the  same  meaning  if  and  only 
if  the  following  is  true: 

□Vx(Fjx  <-»  F2x) 

Much  is  going  to  turn  on  our  semantics  for  these  predicates  and  for  the  necessity  operator, 
but  the  idea  is  clear.  We  see  the  difference  in  the  meaning  between  "...  is  a  human”  and 
“. . .  is  a  rational  animal”  by  identifying  possible  differences  in  extensions.  It  is  possible,  for 
example,  that  some  other  kind  of  animal  could  be  rational  also.  But  then  it  would  not  be 
the  case  that  necessarily  these  two  predicates  are  true  of  the  same  things.  In  a  world  where, 
say,  descendants  of  chimpanzees  were  also  rational,  there  would  be  rational  things  that  are 
not  human. 

The  meaning  of  a  predicate  that  is  distinct  from  its  extension  is  called  its  “intension”. 
We  have  just  described  one  possible  “intensional  semantics”  for  predicates:  the  intension 
of  the  two  predicates  would  be  the  same  if  and  only  if  the  predicates  have  the  same 
extension  in  every  world  (or:  in  every  way  the  world  could  be).  This  seems  to  get  us 
much  closer  to  the  natural  language  meaning.  It  also  seems  to  get  to  something  deep  about 
the  nature  of  meaning:  to  understand  a  meaning,  one  must  be  able  to  apply  it  correctly 
in  new  and  different  kinds  of  situations.  One  does  not  know  beforehand  the  extension  of 
a  predicate,  we  might  argue;  rather,  one  knows  how  to  recognize  things  that  satisfy  that 
predicate — including  things  we  may  not  know  exist,  or  may  even  believe  do  not  exist. 

Many  philosophers  and  others  who  are  studying  semantics  now  use  modal  logic  as  a 
standard  tool  to  try  to  model  linguistic  meanings. 

Modal  logics  have  important  uses  outside  of  semantics.  Here  are  two  problems  in 
metaphysics  that  it  can  help  clarify.  Metaphysics  is  that  branch  of  philosophy  that  studies 
fundamental  problems  about  the  nature  of  reality,  such  as  the  nature  of  time,  mind,  or 
existence. 
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Consider  Aristotle’s  problem:  is  it  the  case  that  if  there  necessarily  is  or  is  not  a  sea  battle 
tomorrow,  then  it  is  the  case  that  necessarily  there  is,  or  necessarily  there  is  not,  a  sea  battle 
tomorrow?  In  each  of  the  systems  that  we  have  described,  the  answer  is  no.  No  sentence 
of  the  following  form  is  a  theorem  of  any  of  the  systems  we  have  described: 

(□($>  v  -.$>)  ->  (□$>  v  □-.$)) 

It  would  be  a  problem  if  we  could  derive  instances  of  this  claim.  To  see  this,  suppose  there 
were  a  sea  battle.  Call  this  claim  S.  In  each  of  our  systems  we  can  prove  as  a  theorem  that 
(S  -»  OS),  which  by  definition  means  that  (S  -»  -illhS).  (Surely  it  is  a  good  thing  that  we 
can  derive  this;  it  would  be  absurd  to  say  that  what  is,  is  not  possible.)  It  is  a  theorem  of 
propositional  logic  that  (S  v  iS),  and  so  by  necessitation  we  would  have  n(S  v  ->S).  Then, 
with  few  applications  of  modus  ponens  and  modal  logic,  we  would  have  DS. 


1.  (n(Sv->S)  -<■  (□SvchS)) 

the  problematic  premise 

2.  S 

premise 

3.  (S  v  ->S) 

theorem 

4.  n(S  v  -iS) 

necessitation,  3 

5.  (dSvchS) 

modus  ponens,  1,  4 

6.  (S  — *  -iln-iS) 

theorem 

7.  -iD-iS 

modus  ponens  6,  2 

8.  DS 

modus  tollendo  ponens,  5,  7 

But  that  seems  wrong.  It  was  possible,  after  all,  that  there  would  not  be  a  sea  battle.  Worse, 
this  reason  could  be  applied  to  any  claim.  If  some  claim  is  true,  then  it  will  be  necessary. 
So  Aristotle  was  right  to  deny  the  claim  that  (□($>  v  -i$>)  -»  (□$  v  □->$)),  and  fortunately 
this  is  not  a  theorem  of  any  of  our  standard  modal  logic  systems. 

Although  we  agree  with  Aristotle  here,  we  may  disagree  with  him  on  another  point. 
Aristotle  says  that  what  is,  is  necessary.  That  is  not  a  claim  in  any  of  our  logical  systems, 
and  it  seems  right  to  keep  it  out.  Most  likely,  Aristotle  mixed  notions  of  time  and  modality 
together,  in  a  way  that,  at  least  from  our  perspective,  seems  confusing  and  perhaps  likely 
to  lead  to  errors.  That  is,  he  seems  to  have  thought  that  once  something  is  true  now,  that 
makes  it  necessary;  as  if  some  feature  of  time  determined  necessity. 

Here  is  an  interesting  case  from  contemporary  metaphysics  where  the  axiom  (M5)  plays  an 
important  role.  The  philosopher  David  Chalmers  (1966-)  has  argued  that  contemporary 
science  will  not  be  able  to  explain  the  character  of  consciousness — that  is,  nothing  like 
contemporary  science  will  explain  what  it  is  like  to  taste  a  strawberry  or  listen  to 
Beethovenf 17  This  is  not  an  anti-science  argument;  rather,  his  point  is  that  contemporary 
science  does  not  have  the  tools  required,  and  will  ultimately  require  radical  new  additional 
theory  and  methods.  That  has  happened  before:  for  example,  at  one  time  many 
philosophers  hoped  to  explain  all  of  nature  in  terms  of  the  motion  and  impact  of  small, 
rigid  particles.  This  theory,  called  “atomism”,  had  great  trouble  accounting  for  gravity  and 
magnetism.  Ultimately,  we  added  fields  to  our  physical  theories.  From  the  perspective  of 
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an  atomist,  this  would  have  meant  the  overthrow  of  their  view:  one  needed  radical  new 
additions  to  atomism  to  explain  these  phenomena. 

Chalmers  predicts  a  similar  kind  of  revolution  is  required  for  progress  in  the  study  of 
consciousness.  However,  one  of  his  primary  arguments  is  very  controversial.  It  goes  like 
this  (I  revise  the  argument  slightly,  to  avoid  some  technical  terminology): 

1.  We  can  conceive  of  beings  that  are  physically  identical  to  us  but  that  lack 
conscious  experiences. 

2.  If  we  can  conceive  of  beings  that  are  physically  identical  to  us  and  that  lack 
conscious  experiences,  then  it  is  possible  that  there  are  beings  that  are  physically 
identical  to  us  but  that  lack  conscious  experiences. 

3.  If  it  is  possible  that  there  are  beings  that  are  physically  identical  to  us  and  that 
lack  conscious  experiences,  then  physical  sciences  alone  cannot  explain  conscious 
experience. 

4.  The  physical  sciences  alone  cannot  explain  conscious  experience. 

This  argument  is  obviously  valid;  it  requires  only  two  applications  of  modus  ponens  to 
show  line  4. 

But  is  the  argument  sound?  This  is  very  important.  Many  people  study  the  human  mind, 
and  many  more  people  would  benefit  if  we  better  understood  the  mind.  Understanding 
consciousness  would  seem  to  be  an  important  part  of  that.  So,  psychologists,  psychiatrists, 
philosophers,  artificial  intelligence  researchers,  and  many  others  should  care.  But,  if  this 
argument  is  sound,  we  should  start  spending  grant  money  and  time  and  other  resources  on 
radical  new  methods  and  approaches,  if  we  want  to  understand  consciousness. 

Many  philosophers  have  denied  premise  1  of  this  argument.  Those  philosophers  argue  that, 
although  it  is  easy  to  just  say,  “I  can  conceive  of  beings  that  are  physically  identical  to  us 
but  that  lack  conscious  experiences”,  if  I  really  thought  hard  about  what  this  means,  I  would 
find  it  absurd  and  no  longer  consider  it  conceivable.  After  all,  imagine  that  the  people 
weeping  next  to  you  at  a  funeral  feel  nothing,  or  that  a  person  who  has  a  severe  wound 
and  screams  in  pain  feels  nothing.  This  is  what  this  argument  claims  is  possible:  people 
could  exist  who  act  at  all  times,  without  exception,  as  if  they  feel  pain  and  sadness  and  joy 
and  so  on,  but  who  never  do.  Perhaps,  if  we  think  that  through,  we’ll  say  it  is  not  really 
conceivable. 

Premise  2  is  very  controversial,  since  it  seems  that  we  might  fool  ourselves  into  thinking 
we  can  conceive  of  something  that  is  not  possible.  It  seems  to  mix  human  capabilities,  and 
thus  subjective  judgments,  with  claims  about  what  is  objectively  possible. 

Premise  3  turns  on  technical  notions  about  what  it  means  for  a  physical  science  to  explain  a 
phenomenon;  suffice  it  to  say  most  philosophers  agree  with  premise  3. 

But  what  is  interesting  is  that  something  appears  to  be  wrong  with  the  argument  if  we 
adopt  the  S5  axiom,  (M5).  In  particular,  it  is  a  theorem  of  modal  logic  system  S5  that  (□$ 
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0n$).  With  this  in  mind,  consider  the  following  argument,  which  makes  use  of  key  claims 
in  Chalmers’s  argument. 

1.  We  can  conceive  of  it  being  true  that  necessarily  a  being  physically  identical  to 
us  will  have  the  same  conscious  experience  as  us. 

2.  If  we  can  conceive  of  it  being  true  that  necessarily  a  being  physically  identical 
to  us  will  have  the  same  conscious  experience  as  us,  then  it  is  possible  that 
necessarily  a  being  physically  identical  to  us  will  have  the  same  conscious 
experience  as  us. 

3.  It  is  possible  that  necessarily  a  being  physically  identical  to  us  will  have  the  same 
conscious  experience  as  us. 

4.  If  it  is  possible  that  necessarily  a  being  physically  identical  to  us  will  have  the 
same  conscious  experience  as  us,  then  necessarily  a  being  physically  identical  to 
us  will  have  the  same  conscious  experience  as  us. 

5.  Necessarily  a  being  physically  identical  to  us  will  have  the  same  conscious 
experience  as  us. 

Premise  1  of  this  argument  seems  plausible;  surely  it  is  at  least  as  plausible  as  the  claim 
that  we  can  conceive  of  a  being  physically  identical  to  us  that  does  not  have  phenomenal 
experience.  If  we  can  imagine  those  weird  people  who  act  like  they  feel,  but  do  not  feel, 
then  we  can  also  imagine  that  whenever  people  act  and  operate  like  us,  they  feel  as  we  do. 

Premise  2  uses  the  same  reasoning  as  premise  2  of  Chalmers’  argument:  what  is  conceivable 
is  possible. 

Line  3  is  introduced  just  to  clarify  the  argument;  it  follows  from  modus  ponens  of  premises 
2  and  1. 

Line  4  uses  (0n5>  -»  □$>),  which  we  can  derive  in  S5. 

The  conclusion,  line  5,  follows  by  modus  ponens  from  4  and  3.  The  conclusion  is  what 
those  in  this  debate  agree  would  be  sufficient  to  show  that  the  physical  sciences  can  explain 
the  phenomenon  (via  the  same  reasoning  that  went  into  line  3  of  Chalmers’s  argument). 

If  we  accept  modal  system  S5,  then  it  seems  that  there  is  something  wrong  with  Chalmers’s 
argument,  since  the  kind  of  reasoning  used  by  the  argument  can  lead  us  to  contradictory 
conclusions.  Defenders  of  Chalmers’s  argument  must  either  reject  axiom  (M5),  or  deny 
line  1  of  this  new  argument.  As  I  noted  above,  this  matters  because  we  would  like  to  know 
how  best  to  proceed  in  understanding  the  mind;  whether  either  of  these  arguments  is  sound 
would  help  determine  how  best  we  can  proceed. 

17.8  An  example  of  another  axiom  system:  Peano  arithmetic 

We  close  this  brief  introduction  to  the  axiomatic  approach  with  a  look  at  how  we  can 
extend  the  first  order  logic  to  enable  us  to  study  a  familiar  specific  task:  arithmetic. 
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We  have  mentioned  Gottlob  Frege  several  times  in  this  text.  This  mathematician  and 
philosopher  made  essential  contributions  to  first  order  logic,  the  philosophy  of  mathematics, 
and  the  philosophy  of  language.  His  primary  goal  in  all  his  work,  however,  was  to  prove 
that  mathematics  was  an  extension  of  logic.  This  view  was  called  “logicism”. 

Why  would  anyone  be  interested  in  such  a  project?  When  Frege  was  writing,  there  were 
many  disagreements  about  some  fundamental  issues  in  mathematics.  These  included  the 
status  of  set  theory,  which  many  found  very  useful,  but  others  disliked.  Frege’s  proposed 
solution  was  very  elegant.  If  we  could  derive  arithmetic  from  a  handful  of  logical  axioms 
that  everyone  would  find  unobjectionable,  then  we  would  have  a  way  to  settle  all  disputes 
about  arithmetic.  To  put  a  controversial  claim  beyond  criticism,  we  would  have  to  show 
that  we  could  derive  it  directly  from  our  unobjectionable  axioms. 

Unfortunately,  Frege’s  project  failed.  His  ultimate  axiomatization  of  arithmetic  proved  to 
have  a  fatal  flaw.  While  Frege  had  the  final  volume  of  this  project  at  the  press,  he  received 
a  postcard  from  a  young  philosopher,  Bertrand  Russell.  Russell  was  attempting  the  same 
project  as  Frege,  but  he  had  noticed  a  problem.  In  Frege’s  system,  Russell  was  able  to  derive 
a  contradiction.  This  was  devastating:  just  as  in  our  logic,  in  Frege’s  logic  you  could  derive 
anything  from  a  contradiction.  And  so  this  meant  Frege’s  system  was  inconsistent:  it  could 
prove  falsehoods. 

What  Russell  proved  is  now  called  “Russell’s  Paradox”.  It  would  be  better  called  “Russell’s 
Contradiction”,  since  it  is  explicitly  a  contradiction.  The  logical  systems  that  Russell  and 
Frege  used  had  special  entities  called  “classes”,  but  given  that  most  people  today  are  more 
familiar  with  sets,  we  can  restate  Russell’s  Contradiction  with  sets.  Russell  showed  that  in 
Frege’s  system  he  could  derive  a  sentence  that  meant  something  like:  there  is  a  set  of  all  sets 
that  do  not  belong  to  themselves. 

Think  about  this  for  a  moment.  Does  this  set  belong  to  itself?  If  it  does,  it  should  not, 
because  it  is  supposedly  the  set  of  all  and  only  those  things  that  do  not  belong  to  themselves. 
Thus,  it  should  not  belong  to  itself.  But  then,  it  would  be  one  of  those  sets  that  do  not 
belong  to  themselves,  and  so  it  should  belong  to  itself.  We  have  a  contradiction. 

Frege  was  devastated.  From  our  perspective,  this  is  unfortunate,  because  his 
accomplishments  were  already  so  great  that  he  need  not  have  felt  this  error  was  a  refutation 
of  all  his  work. 

Logicism  did  not  die.  Russell  found  a  way  to  avoid  the  contradiction  using  a  kind  of  logical 
system  called  “type  theory”,  and  he  pursued  the  construction  of  arithmetic  in  his  huge  book, 
written  with  Alfred  Whitehead,  Principia  Mathematica.  This  book’s  system  is,  unfortunately, 
much  more  cumbersome  than  Frege’s  system,  and  earned  few  converts  to  logicism.  Today, 
however,  there  are  still  mathematicians  who  believe  that  type  theory  may  be  the  best  way 
to  understand  mathematics. 

More  often,  however,  treatments  of  the  foundations  of  arithmetic  turn  directly  to  an 
axiomatization  that  includes  explicitly  mathematical  elements.  The  most  familiar  of  these 
is  typically  called  “the  Peano  axioms”,  although  these  are  significantly  modified  from  the 
version  of  Giuseppe  Peano  (1858-1932)  after  whom  they  are  named.  Peano  himself  cites 
Richard  Dedekind  (1831-1916)  as  his  source.  A  standard  version  of  Peano’s  axioms  is 
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surprisingly  compact,  and  proves  sufficient  to  do  all  the  things  we  want  arithmetic  to  do. 
The  axioms,  however,  clearly  assume  and  specify  things  about  numbers  and  operations  on 
numbers.  We  sometimes  call  such  axioms  “non-logical  axioms”  or  “proper  axioms”,  not 
because  there  is  something  irrational  about  them,  but  because  they  clearly  aim  to  capture 
some  rich  notion  that  may  appear  to  be  more  specific  than  a  highly  general  logical  notion. 

Here  is  one  possible  set  of  axioms  for  arithmetic,  restricted  to  addition  and  multiplication. 

We  assume  all  the  axioms  of  the  first  order  logic  described  in  section  17.6  (that  is,  we  have 
(Ll)  through  (L5)).  We  have  a  single  special  term,  0.  We  have  three  special  function  letters 
in  our  language,  f,  and  f2,  which  are  arity  two,  and  f3,  which  is  arity  one.  f,  and  f2  are  to 
be  given  an  interpretation  consistent  with  addition  and  multiplication.  f3  is  the  successor 
function;  think  of  it  as  adding  one  to  any  number.  We  have  an  arity-two  predicate  Pi, 
which  we  interpret  as  we  would  “=”.  However,  for  reasons  of  familiarity,  we  will  write  “+” 
and  and  “=”.  For  the  successor  function  we  will  use  Thus  we  write  the  successor  of 
0  by  writing  O'.  The  successor  of  0  is  of  course  1.  The  successor  of  1  is  2,  expressed  0",  and 
so  on. 

With  all  that  in  mind,  one  such  axiom  system  is: 


(Al) 

(xj  =  X2  -*  x/  =  x2) 

(A2) 

(x/  =  X2'  -»  Xi  =  x2) 

(A3) 

(xt  =  X2  -»  ( Xt  =  x3  ->  x2  =  x3)) 

(A4) 

J 

* 

II 

o 

(A5) 

Xi  +  0  =  xt 

(A6) 

Xi  +  x2  =  (xt  +  x2)' 

(A7) 

o 

II 

o 

£ 

(as) 

Xj  •  X2  =  (xj  •  x2)  +  X] 

(Remember  that  “+”  and  are  used  here  only  as  shorthand  for  our  functions;  in  our 
object  language  we  do  not  have  infix  notation  and  we  do  not  need  the  disambiguating 
parentheses  shown  here  for  axioms  (A6)  and  (A8).  That  is,  axiom  (A6),  for  example,  is 
actually  V  ^x  tx2 .  Looking  at  that  sentence,  you  will  no  doubt  realize  why  it  is 

very  helpful  to  switch  over  to  our  usual  way  of  writing  these  things.) 

We  can  also  make  mathematical  induction  an  explicit  part  of  this  system  (instead  of  a 
principle  of  our  metalogical  reasoning  alone),  and  include  it  as  an  axiom: 

(A9)  ($(0)  -  (Vx,(^(x,)  -<t(x,-'))  -Vx,<P(x,))) 

These  axioms  are  sufficient  to  do  everything  that  we  expect  of  arithmetic.  This  is  quite 
remarkable,  because  arithmetic  is  very  powerful  and  flexible,  and  these  axioms  are  few  and 
seemingly  simple  and  obvious.  Since  these  explicitly  formulate  our  notions  of  addition, 
multiplication,  and  numbers,  they  do  not  achieve  what  Frege  dreamed;  he  hoped  that  the 
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axioms  would  be  more  general,  and  from  them  one  would  derive  things  like  addition 
and  multiplication.  But  this  is  still  a  powerful  demonstration  of  how  we  can  reduce  great 
disciplines  of  reasoning  to  compact  fundamental  principles. 

Axiomatizations  like  this  one  have  allowed  us  to  study  and  discover  shocking  things  about 
arithmetic,  the  most  notable  being  the  discovery  by  the  logician  Kurt  Godel  (1906-1978) 
that  arithmetic  is  either  incomplete  or  inconsistent. 

In  closing  our  discussion  of  axiomatic  systems,  we  can  use  this  system  to  prove  that  1+1=2. 
We  will  use  the  indiscernibility  of  identicals  rule  introduced  in  15.3.  We  start  by  letting 
x3  be  0'  and  x2be  0  to  get  the  instance  of  axiom  (A6)  on  line  1,  and  the  instance  of  (A5)  on 
line  2. 


1. 

0'  +  0'  =  (O'  +  0)' 

axiom  (A6) 

2. 

0'  +  0  =  0' 

axiom  (A5) 

3. 

0'  +  0'  =  0" 

indiscernibility  of  indenticals,  1,  2 

Because  line  2  tells  us  that  0'  +  0  and  0'  are  the  same,  using  indiscernibility  of  identicals  we 
can  substitute  0'  for  0'  +  0  in  the  right  hand  side  of  the  identity  in  line  1.  Remember  that 
the  parentheses  are  there  only  to  disambiguate  our  shorthand;  this  means  (O')'  and  0"  are 
actually  the  same. 


17.9  Problems 

1.  Use  a  truth  table  to  show  that  the  following  sentences  are  equivalent  (using  your 
natural  deduction  system  definitions  of  the  connectives,  as  given  in  part  I  of  the  book). 

a.  (5>AW)  and 

b.  (^UP)  and  (-^-»^). 

c.  ($  ~  W)  and 

2.  Prove  the  following  arguments  are  valid,  using  only  our  axioms  of  the  axiomatic 
propositional  logic,  modus  ponens,  and  the  deduction  theorem.  For  some  of  these 
problems,  you  will  need  to  translate  some  of  the  sentences  as  required  into  the 
corresponding  formula  with  only  negation  and  conditional  as  our  connectives  (follow 
the  interpretation  you  studied  in  problem  l). 

a.  {(PiaP2)}I-Pi 

b.  {(-P2,  (P3-P2)}l-  -Pi 

C.  {(PiVPd^Pjl-Pi 

d.  {(P1-(P2  -P3))}l—  (P2-  (Pi  -P3)) 

3.  Describe  a  function  that  is  on  A  and  onto  B  and  also  1-to-l,  so  that  we  know  that  lAl  = 
iBl,  for  each  of  the  following.  Your  function  can  be  stated  in  informal  mathematical 
notation.  Remember  that  the  natural  numbers  are  {1,2,3...}  and  the  integers  are  all 
the  whole  numbers,  both  negative  and  positive,  and  including  0. 
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a.  A  is  the  even  natural  numbers,  B  is  the  odd  natural  numbers. 

b.  A  is  the  natural  numbers,  and  B  is  the  numbers  {10,  11,  12...}. 

c.  A  is  the  natural  numbers,  B  is  the  negative  integers. 

4.  Prove  the  following  claims.  Your  proof  can  use  the  definitions  given  in  17.5  and  any 
theorems  of  our  first  order  logic,  and  can  be  stated  in  English. 

a.  For  any  sets  A  and  B,  ({A}={B}  A=B). 

b.  For  any  sets  A  and  B,  (AcB  <-»  AnB=A). 

c.  For  any  sets  A  and  B,  (AcB  <-»  AuB=B). 

d.  For  any  set  A,  (A 00=0). 

5.  Prove  the  following,  keeping  as  resources  the  propositional  logic  and  the  specified 
modal  system. 

a.  InM: 

b.  In  S4:  (□$«□□$). 

c.  In  S4: 

d.  In  S5: 

For  these  problems  and  the  next,  you  will  find  they  are  much  easier  if  you  allow  yourself 
two  rules  sometimes  added  to  natural  deduction  systems.  These  are:  (l)  Contraposition, 
which  allows  one  to  assert  that  (-ivF^-i<t>)  given  (<K-W);  and  (2)  a  relaxing  of  double 
negation  that  allows  one  to  remove  any  double  negation  from  anywhere  in  a  formula,  or 
insert  any  double  negation  anywhere  in  a  formula  as  long  as  the  result  is  well  formed. 

6.  Show  that  S5  is  equivalent  to  the  combination  of  Brouwer  and  S4.  You  can  do  this  by 
showing  that  from  the  axioms  of  Brouwer  ((Ml),  (M2),  and  (M3))  and  S4  (axiom  (M4)) 
you  can  derive  the  additional  axioms  of  S5  (in  this  case,  just  (M5)).  Then  show  that 
from  the  axioms  of  S5  ((Ml),  (M2),  and  (M5))  you  can  derive  the  additional  axioms  of 
Brouwer  and  S4  (that  is,  (M3)  and  (M4)). 

7.  Prove  the  following  theorems,  using  only  our  axioms  of  the  axiomatic  first  order  logic, 
modus  ponens,  ana  the  deduction  theorem. 

a.  Vxi(Pi(xi)  -»  P2(x,))  -»  (Vx|P,(x,)  -^Vx|P2(xi)) 

b.  Vx1(-.P2(x1)  ->  ->Pi(x1))  ^  Vx1(P1(x1)  ^  P2(xi)) 

c.  Vx^P^Xj)  >-,P2(xi))  ->  Vx(P2(x!)  ->  — ,Pi(x1)) 

d.  Vxi-i(P1(x1)  -»  ->P2(x1))  -»  VxiP^Xi) 

8.  Using  only  first  order  logic,  the  Peano  axioms,  and  indiscernibility  of  identicals,  prove 
that 

a.  1+0  =  0+1 

b.  2+2  =  4 

c.  2-2  =  4 

d.  2-2  =  2+2 
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[13]  A  good  source  is  (Mendelson  2010:  32-34). 

[14]  See  Tarski  (1956). 

[15]  This  translation  is  based  on  that  in  (Barnes  1984:  29). 

[16]  See  Camap  (1956). 

[17]  See  Chalmers  (1996). 
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